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Preface 


Early  in  1990  a  scientific  committee  was  formed  for  the  purpose  of  organizing  a  high-level 
scientific  meeting  on  Future  Directions  of  Nonlinear  Dynamics  in  Physical  and  Biological 
Systems,  in  honor  of  Alwyn  Scott’s  60th  birthday  (December  25,  1991).  As  preparations  for 
the  meeting  proceeded,  they  were  met  with  an  unusually  broad-scale  and  high  level  of 
enthusiasm  on  the  part  of  the  international  nonlinear  science  community,  resulting  in  a 
participation  by  168  scientists  from  23  different  countries  in  the  conference,  which  was  held 
July  23  to  August  1  1992  at  the  Laboratory  of  Applied  Mathematical  Physics  and  the  Center 
for  Modelling,  Nonlinear  Dynamics  and  Irreversible  Thermodynamics  (MIDIT)  of  the 
Technical  University  of  Denmark.  During  the  meeting  about  50  lectures  and  100  posters 
were  presented  in  9  working  days. 

The  contributions  to  this  present  volume  have  been  grouped  into  the  following  chapters: 

1.  Solitons 

2.  Superconducting  devices 

3.  Biomolecular  dynamics 

4.  Nonlinear  optics 

5.  Classical  and  quantum  mechanical  lattice  dynamics, 

reflecting  some  of  the  many  different  areas  of  nonlinear  science  to  which  A1  Scott  has  made 
early  and  fundamental  contributions.  They  form  a  subset  of  the  total  number  of  the  papers 
and  posters  presented  at  the  meeting,  judged  by  the  editors  to  be  most  representative  of  a 
cross-section  of  Al’s  own  research.  Other  papers  from  the  meeting,  which  cover  an  even 
broader  spectrum,  have  been  selected  for  publication  in  “Future  Directions  of  Nonlinear 
Dynamics  in  Physical  and  Biological  Systems”,  NATO  Advanced  Study  Institute  Series,  to 
appear  with  Plenum  Press. 

People  who  have  worked  with  Al  know  that  he  insists  that  everyone  s.iould  do  what  he  or 
she  likes  to  do;  this  is  certainly  true,  but  it  is  also  true  that  from  the  very  early  days  he  has 
demonstrated  a  unique  ability  to  stimulate  research  in  new  directions.  Moreover,  his  ability 
to  interact  with  people  and  to  transmit  his  optimism  and  enthusiasm  to  those  around  him  has 
furnished  a  solid  launching  pad  for  these  new  lines  of  research.  Many  friends  and  colleagues 
around  the  world  are  thankful  to  Al  for  having  received  such  a  stimulus.  This  was  the 
essential  motivation  for  our  meeting  and  for  the  publication  of  these  Proceedings.  It  is  a 
small  way  of  saying  “Thank  you,  Al,  and  many  happy  returns”. 

It  is  our  pleasure  to  acknowledge  financial  support  from  the  NATO  Scientific  and 
Environmental  Affairs  Division  (grant  SA. 9-15-03),  the  Army  Research  Office  of  the  United 
States  Army  Laboratory  Command  (grant  29333  MA-CF),  the  United  States  Army  Euro¬ 
pean  Research  Office  (grant  RD  6891-MA-02),  the  National  Science  Foundation  (grant  ASI 
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910728),  the  Danish  Technical  Research  Council  (grant  16-4932-1  OS),  NORDITA,  the 
COWI  foundation  (grant  A-51.77/TJ/IJO),  and  The  Technical  University  of  Denmark 
(basic  research  grant  to  MIDIT). 

The  Scientific  Committee,  consisting  of  D.K.  Campbell,  Los  Alamos,  G.  Careri,  Rome, 
P.L.  Christiansen,  Lyngby,  A.S.  Davydov,  Kiev,  J.C.  Eilbeck,  Edinburgh,  A.  Luther, 
Copenhagen,  D.W.  McLaughlin,  Princeton,  A.C.  Newell,  Tucson,  and  R.D.  Parmentier, 
Salerno,  was  particularly  helpful  in  the  organization  of  the  scientific  program. 

We  warmly  thank  K.  Fisker,  L.  F0nss,  L.  Gudmandsen,  L.  MacNeil,  M.P.  Sdrensen  and 
many  students  at  MIDIT  for  their  generous  assistance  during  the  meeting  and  the  prepara¬ 
tion  of  these  Proceedings. 


Peter  Christiansen 
Chris  Eilbeck 
Bob  Parmentier 
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We  examine  a  variety  of  numerical  and  approximate  analytical  methods  to  study  families  of  solitary  waves  on 
lattices.  Such  waves,  when  they  exist,  travel  through  the  lattice  without  loss  of  energy,  and  have  approximate  soliton 
properties  on  collision.  Cotresponding  quantum  problems  are  also  briefly  described. 


1.  Introduction 


The  study  of  solitons  on  discrete  lattices  date 
back  to  the  early  days  of  soliton  theory.  The  fa¬ 
mous  paper  by  Fermi,  Pasta  and  Ulam  (FPU) 
which  inspired  the  pioneering  work  on  the  KdV 
problem  by  Kruskal  and  Zabusky  (cf.  [12]  for 
references )  was  a  numerical  study  of  a  crystal  lat¬ 
tice  model  with  an  anharmonic  interatomic  po¬ 
tential.  The  Hamiltonian  of  such  a  system  could 
be  written  as 


H  =  (1.1) 

n 

where  qn  is  the  displacement  of  the  nth  particle 
from  its  equilibrium  position,  and  V{q„+i-q„) 
is  the  anharmonic  interaction  potential.  In  the 
FPU  study,  V'(q)  =  yq  +  aq'”,  m  =  2  or  3. 
If  the  relative  displacement  of  the  nth  bond  is 
deflned  to  be  =  q„+  \-qn,  then  the  equation 
of  motion  becomes 


d^(/>n 

df2 


(1.2) 


A  continuum  approximation  to  the  FPU  prob¬ 
lem  leads  to  the  KdV  equation  (m  =  2)  or  the 
MKdV  equation  (m  =  3),  (see  [12]  for  de¬ 
tails).  The  analytic  solution  of  such  equations 
through  the  inverse  spectral  transform,  and  the 


associated  set  of  exact  soliton  properties,  is  now 
well  known. 

For  the  discrete  problem,  Toda  [31  ]  showed 
that  the  special  choice  of  F(<^)  =  a[l  - 
exp(-60)]  let  to  a  set  of  equations  which  also 
had  exact  multi-soliton  solutions.  Ablowitz  and 
Ladik  [  1  ]  discovered  another  integrable  lattice 
which  was  a  discrete  version  of  the  Nonlinear 
Schrodinger  equation,  and  a  number  of  other 
lattice  equations,  either  continuous  or  discrete 
in  time,  have  been  shown  to  have  this  property. 
However  the  basic  model  investigated  by  FPU 
is  not  integrable,  and  unfortunately  little  theory 
is  possible  once  this  property  is  lost. 

In  this  paper  we  survey  some  recent  investi¬ 
gations  of  nonintegrable  lattice  problems  which 
have  soliton  or  soliton-like  solutions  (wc  will  use 
the  word  ‘*soliton”  in  the  loose  sense  as  a  lo¬ 
calized  travelling  wave  which  survives  collision 
with  other  such  objects  unchanged,  or  almost  un¬ 
changed).  Such  studies  are  by  necessity  approx¬ 
imate,  although  we  present  both  numerical  and 
analytic  approximations  (here  by  “analytic  ap¬ 
proximation”  we  mean  a  simple  analytic  formula 
with  parameters  derived  by  numerically  or  ana¬ 
lytic  means). 

There  are  two  basic  numerical  approaches  we 
have  used  in  our  study.  One  is  to  look  for  trav¬ 
elling  waves,  i.e.,  solutions  of  the  form  <^n(0  = 
<f>{n  -  ct)  =  <i>(z),  with  <i)(z)  — ►  0  as  |z|  — >  cx3, 
which  leads  to  the  equation 
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Fig.  1.  Travelling  waves  for  the  case  V (q)  =  q  +  q^. 

+  F'(<A(z- D).  (1.3) 

We  solve  this  equation  using  pseudo-spectral 
methods  (17).  By  treating  c  as  a  parameter  we 
can  use  path-foliowing  methods  to  trace  out  the 
whole  curve  of  solutions  as  c  varies. 

In  order  to  study  the  stability  or  interactions 
of  such  solitary  wave  solutions,  or  to  study  more 
arbitrary  initial  conditions,  we  have  investigated 
a  class  of  symplectic  solvers  for  the  initial  value 
problem  (1.2)  [IS].  Some  typical  results  are  de¬ 
scribed  in  section  2. 

Another  alternative  approach  is  to  find  ana¬ 
lytic  approximations  to  travelling  waves,  either 
by  continuum  approximations  to  (1.3)  at  var¬ 
ious  levels  of  approximation,  i.e.,  low  c  expan¬ 
sions,  or  expansions  in  small  values  of  1/c,  or 
by  the  use  of  variational  or  weak  formulations 
to  fix  parameters  in  some  assumed  formula  for 
the  travelling  wave.  This  approach  is  described 
in  section  3. 


2.  Nnmerical  results 

Results  for  the  potential  V'(q)  =  q  ■¥  2q^  are 
described  in  [  1 7].  As  another  example  we  show 
in  fig.  1  some  results  for  the  potential  V'{q)  = 
q  +  q^.  Here  the  main  graph  shows  the  plot  of 
maximum  height  against  the  velocity  c,  for  c  in 


the  range  1  <  c  <  2.  The  inset  shows  the  corre¬ 
sponding  travelling  wave  profiles  for  three  points 
on  the  curve.  We  see  that  the  energy  of  the  wave 
is  concentrated  on  the  three  most  central  grid 
points.  Numerical  studies  with  these  profiles  as 
initial  conditions  shows  the  pulses  are  stable,  at 
least  for  medium  time  scales.  We  mention  in 
passing  that  a  similar  calculation  can  be  used  to 
investigate  periodic  solutions  of  ( 1 .3 )  or  indeed 
solitary  wave  or  periodic  solutions  of  continuous 
problems. 

A  related  equation  where  the  nonlinearity  is  in 
the  derivative  term,  arising  in  an  electrical  lat¬ 
tice  problem,  is  discussed  in  [19].  In  this  case 
the  curvature  of  the  apex  of  the  solitary  wave  ap¬ 
pears  to  blow  up  at  a  finite  value  of  the  ampli¬ 
tude,  i.e.,  the  tip  develops  a  cusp.  Unfortunately 
this  corresponds  to  a  parameter  value  for  which 
the  model  is  no  longer  physical.  It  would  be  in¬ 
teresting  to  find  this  effect  in  a  physically  realis¬ 
able  model. 


If  we  try  to  carry  out  the  same  calculation  for 
the  discrete  sine-Gordon  (DSG)  equation 

—  ^n+ 1  ~  "F  ^n+ 1  ~  sin  (/>«,  (2.1 ) 

we  were  unable  to  find  travelling  waves  with  the 
property  that  (f>{z)  0  (mod27r)  as  |z|  oo. 

The  best  we  could  find  is  shown  in  fig.  2,  where 
we  have  cut  out  the  central  part  of  the  kink  in  or¬ 
der  to  highlight  the  behaviour  of  the  tails.  It  can 
be  seen  that  the  tails  appear  to  be  periodic  waves 
with  a  small  constant  amplitude.  The  calculation 
is  set  up  in  such  a  way  that  the  tails  travel  at 
the  same  velocity  as  the  main  kink.  e  hope  to 
investigate  the  effect  of  varying  the  background 
wave  velocity  independently  of  the  kink  veloc¬ 
ity  in  the  future.  It  seems  that  the  medium  must 
be  ‘^pumped”  with  background  radiation  before 
the  kink  will  travel  in  a  lossless  manner.  This 
finding  is  consistent  with  the  results  of  Peyrard 
and  Kruskal  [26],  who  showed  that  a  kink  in 
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Fig.  2.  Travelling  kink  for  the  DSG  equation,  c  =  0.316228, 

r  =  1. 

a  medium  with  no  initial  background  radiation 
loses  energy  as  it  propagates.  If  the  velocity  of 
the  combined  wave  in  fig.  2  is  varied,  the  size  of 
the  background  radiation  increases  -  the  exam¬ 
ple  we  have  chosen  corresponds  to  a  minimum 
in  this  amplitude. 

Discrete  SG  breathers.  Instead  of  looking  for 
travelling  kink  solutions  of  the  DSG  equation 
(2.1),  we  can  look  for  stationary  breather  type 
solutions  of  the  form  (/>„(t)  =  4arctan  f{n,t). 
We  assume  periodic  boundary  conditions, 
f(n,t)  =  f{n  +  L,t)  and  consider  solutions 
which  are  even  in  both  space  and  time,  i.e.,  / 
can  be  expanded  as  a  double  Fourier  series 

f(n,t)  =  ^'^aij  cos  j(i)t.  (2.2) 

«  j 

One  simple  solution  of  this  type,  which  is  con¬ 
stant  in  n,  is  given  by  [25] 

fin,t)  =  Acd(i2t\m),  (2.3) 

where  cd  is  one  of  the  Jacobian  elliptic  functions 
[3]  and  where  Q  =  F/il  -i-  A^)  and  m  =  A^. 

Now  we  find  a  bifurcation  diagram  where  the 
localised  wave,  in  this  case  a  breather,  bifurcates 
from  a  simple  solution,  see  fig.  3.  The  numer¬ 
ical  path-following  suggests  that  breather  solu¬ 
tions  only  exist  when  F  is  sufficiently  small.  By 
using  an  accurate  and  efficient  symplectic  inte¬ 
gration  scheme  [5,1  S]  on  the  numerical  output 
from  the  path-following  code  it  is  confirmed  that 


Fig.  3.  Discrete  sine-Gordon  breather  bifurcating  from 
the  simple  solution  (2.3)  for  L  =  30  and  (o  =  0.9. 
A  =  max  /  (n,t). 


the  breather  appears  to  be  stable  and  does  not 
lose  energy  through  radiation. 

2.2.  The  discrete  nonlinear  Schrodinger 
equation 

The  discrete  NLS  equation  appears  in  sev¬ 
eral  applications  in  condensed  matter  physics 
(4,1 1,341  and  in  nonlinear  optics  (8].  We  con¬ 
sider  the  nonintegrable  discrete  NLS  equation 

+  <i>n-X  =  0  (2.4) 

with  periodic  boundary  conditions  (f>n+L  =  (t>n, 
where  L  is  the  number  of  lattice  points.  Hence, 
all  the  solutions  we  find  are  periodic  with  period 
L  .  For  large  L  we  can  expect  to  find  good  ap¬ 
proximations  to  solitary  waves  which  can  be  re¬ 
garded  as  periodic  waves  in  the  lim’ '  of  infinite 
period.  We  will  look  for  travelling  waves  of  the 
form  of  an  envelope  travelling  wave  modulating 
a  carrier  wave  travelling  at  a  different  speed,  i.e., 

(i>„(t)  =  <i>(n-ct) 

=  (^(z)  (2.5) 

where  c  is  the  speed  of  the  envelope  of  the 
travelling  wave.  The  periodic  boundary  condi¬ 
tion  <f>{z  +  L)  =  (f>{z)  requires  to  be  of  the 
form  k  ~.  InmlL  where  m  is  an  integer.  With 
the  ansatz  (2.5)  inserted  into  (2.4)  we  find 
that  <^(z)  must  satisfy  the  complex  nonlinear 
differential-advance-delay  equation 
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Fig.  4.  Paths  of  travelling  waves  in  the  discrete  NLS  equa¬ 
tion  (2.4)  for  L  =  40  and,  from  the  left,  (a)  c  =  0.01,  (b) 
c  =  1.2,  (c)  c  =  1.6.  Localized  (“solitary”)  waves  bifurcate 
from  stationary  solutions  (straight  lines),  but  only  seem  to 
exist  when  y  is  sufficiently  small  as  indicated  by  the  dotted 
line.  The  inset  shows  waveforms  along  the  c  =  1.2  path. 

-1-  y\<l)(z)\^<f>(z) 
e**  <t>{z  -1-  1 )  -F  e~‘*  (f>{z  -  1 ) 

=  0.  (2.6) 

Apart  from  the  trivial  solution  <^(z)  =  0  this 
equation  has  the  obvious  stationary  solution 
[18],  <^(z)  =  0  where  ^  is  a  (complex)  con¬ 
stant.  The  dispersion  relation  for  the  normalised 
(i.e.,  =  1)  stationary  solutions  is 

easily  found  as 

(o  =  -^-2cosk.  (2.7) 

The  stationary  solutions  are  interesting  in  this 
context  because  localised  travelling  (“solitary”) 
waves  bifurcate  from  them.  The  bifurcation 
points  can  be  found  analytically  using  standard 
bifurcation  theory  [21]. 

The  travelling  waves  which  bifurcate  from 
the  stationary  solutions  are  calculated  numer¬ 
ically  using  the  path-following  method  [17]. 
As  we  follow  the  path  away  from  the  bifurca¬ 
tion  point,  i.e.,  for  increasing  nonlinearity,  we 
see  how  the  travelling  wave  becomes  more  and 
more  localised  (cf.  fig  4)  until  (for  c  ^  0)  at 
some  point  the  path  stops. 

Until  the  path  stops  the  bifurcation  diagram  is 
very  similar  to  what  is  found  in  the  continuum 
limit  [35  ].  However,  in  the  discrete  case  we  also 
find  dark  solitary  waves  for  positive  y.  In  the 


continuum  limit  dark  solitary  waves  only  exist 
for  negative  y  [2,23]. 

If  we  consider  the  discrete  NLS  equation 
we  can  expect  to  find  dark  solitary  waves  for 
negative  y  (just  as  we  found  bright  solitary 
waves  for  positive  y).  Assume  that  <l>n(t)  = 
^(z)  exp[i(fcn  -  (ot)]  is  one  such  dark  solitary 
wave  for  y  =  -yo  <  0-  Then  it  is  easily  shown 
by  insertion  that  the  dark  solitary  wave 

^nit)  = 

X  exp^i{[(2fl  +  1  );r  -  A:]/i -F  to/}),  (2.8) 

where  q  is  an  integer,  satisfies  the  discrete  NLS 
equation  (2.4)  with  y  =  -Fyo- 

2.3.  Davydov’s  equations 

Davydov’s  equations  have  been  proposed  as  a 
model  for  propagation  of  vibrational  energy  in 
the  form  of  solitary  waves  in  a-helix  protein  and 
take  the  form  [10,11] 

(2.9) 

+  Xi\(f>n\^-\<f>n-l\^).  (2.10) 

Here  is  the  probability  of  finding  a  quan¬ 
tum  of  amide-I  vibrational  energy  at  site  n  and 
fin  is  the  longitudinal  displacement  of  the  nth 
peptide  group  of  the  protein.  /  ,  /,  A/  and  w  are 
physical  constants  for  which  we  will  use  the  es¬ 
timates  [30]  w  =  50  N/m,  Af  =  5.7  x  10“^^ 
kg  and  J  =  1.55  x  10“^^  J.  At  present  it  is  un¬ 
clear  what  value  x  should  take  [7]  although  ex¬ 
periments  [6,7]  suggest;!'  =35-62  pN. 

Note  that  under  the  “adiabatic”  approxima¬ 
tion,  d^finldt^  =  0,  eq.  (2.10)  is  satisfied  for 
fin+i  -  fin  =  -xl^nWw  and  eq.  (2.9)  reduces 
to  the  discrete  NLS  equation  (2.4)  with  y  = 
X^IJw  (and  /  replaced  by  the  scaled  time  Jt/h). 

With  the  above  choice  of  parameters  in  Davy¬ 
dov’s  equations  (2.9),  (2.10)  we  find  a  bifur- 
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cation  diagram  very  similar  to  that  found  for 
the  discrete  NLS  equation  [22];  stable  solitary 
waves  bifurcate  from  stationary  solutions  and 
the  paths  of  solitary  waves  stop  at  some  point 
as  X  is  increased.  For  small  values  of  c  this  hap¬ 
pens  for;t  «  120  pN.  We  also  find  dark  solitary 
waves. 

2.4.  2D  lattices 

Some  results  on  a  generalization  of  (1.2)  to 
isotropic  2D  lattices  is  given  in  [19].  An  inter¬ 
esting  anisotropic  lattice,  proposed  by  Zakharov, 
which  reduces  to  the  KP  equation  in  the  contin¬ 
uum  limit,  is  discussed  in  [  14]. 


3.  Analytic  approximations 

In  this  section  we  outline  various  analytic  ap¬ 
proximations  to  solitary  wave  solutions  of  the 
lattice  equation  (1.2).  We  start  with  the  travel¬ 
ling  wave  equation  (1.3)  obtinedfrom  (1.2)  by 
the  travelling  wave  ansatz.  For  illustration,  we 
consider  the  cubic  polynomial  potential 

=  (3.1) 

where  a  >  0  is  a  constant.  We  examine 
continuum-like  approximations  derived  from  a 
Fourier  transformed  version  of  eq.  (1.3)  and 
valid  in  various  wave  speed  ranges  and  one 
method  valid  in  the  high  speed,  large  amplitude 
limit.  We  also  describe  methods  based  on  vari¬ 
ational  and  weak  formulations  of  the  travelling 
wave  equation  (1.3)  which  are  valid  at  all  wave 
speeds. 


^Mk)F(k).  (3.2) 

where  g{k)  represents  the  Fourier  transform  of 
g(z)  and  we  use  the  abbreviated  notation  F{z) 
for  V'i<i>(z)).  Inserting  Taylor  series  or  Fade  ap¬ 
proximations  of  the  transformed  operator  A(k) 
in  the  equation  above,  we  then  invert  the  trans¬ 
form  to  obtain  standard  nonlinear  DDEs  for  the 
solution  0  of  (1.3). 

For  example,  using  the  (0,2)  Fade  approxima¬ 
tion  1  /  ( I  +  )  of  /I  (/c )  generates  the  second 

order  ODE 


cHl  -  =  F'(^), 

where  the  operator  D  =  d/dz.  This  has  as  a 
solution 

X  sech^  (3.3) 

using  the  cubic  potential  (3.1).  This  ODE  is  also 
obtained  via  an  approximation  of  the  original 
lattice  equation  (1.2)  by  the  regularised  or  im¬ 
proved  Boussinesq  equation  as  in  [28,29]  and 
by  a  very  different  approach  in  [9]. 

The  most  accurate  approximation  of  A(k) 
which  leads  to  a  tractible  ODE  is  the  (2,2)  Fade 
approximation 


A{k)  w 


^  ~ 

1  +  ^k^' 


which,  after  substitution  in  (3.2),  multiplication 
by  1  -I-  j^k^  and  inversion  of  the  transform,  gives 
the  ODE 


3. 1.  Continuum  approximations 

We  start  by  deriving  various  continuum  ap¬ 
proximations  from  a  Fourier  transformed  ver¬ 
sion  of  (1.3)  (see  [24]  for  example): 


c2(1-^D2),^=  (1  + 

where  the  differential  operator  D  =  d/dz.  An 
implicit  solution  of  the  ODE  can  be  found 
and  for  the  cubic  potential  (3.1 )  we  obtain  the 
speed-height  relationship 


6 


D.B.  Duncan  et  al.  /  Solitons  on  lattices 


Height  =  <^(0) 

=  ^[4c2-9  +  cv^5(14c2-9)] 

for  the  travelling  wave.  The  solution  <f>  is  found 
from 

2 

±  z  =  -7=tan~‘  & 

(2c^  +  3)  (  1  -  0^Pla<i>{Q)  \ 

*Vl  -^e^pla<i>(Q))' 

(3.4) 

where  the  constant  and  the  function  O  are 
defined  by 


(3.3),  we  see  that  this  approximation  has  given 
us  a  different  length  scale  for  the  waves. 

All  these  methods  have  been  applied  to  the 
two-dimensional  lattice,  the  KP  lattice  [14],  an 
electrical  transmission  lattice  of  Remoissenet 
and  Michaux  [27]  and  a  lattice  with  second, 
nearest  neighbour  interactions  [32].  For  more 
details  of  these  results  see  [33].  The  only  sig¬ 
nificant  obstacle  to  its  use  on  more  complicated 
lattice  equations  is  being  able  to  isolate  the 
operator  into  a  single  term  in  Fourier  space. 

3.2.  Asymptotics  of  fast  waves 


p  =  ^[9-4c2  +  c^5(14c2-9)], 


OHz) 


a[(i>(Q)  -  (l>(z)\ 
a<f){z)  +  P 


(3.5) 


For  fixed  z,  <^  ( z )  can  be  found  by  first  solving  for 
0  in  (3.4)  by  Newton’s  method  and  then  for  tf)  in 
(3.5).  Although  the  expressions  are  complicated, 
the  process  can  be  automated  quite  easily. 

The  approximations  so  far  derived  have  relied 
on  expanding  A(k)  for  small  k,  assuming  that 
the  waves  are  mainly  composed  of  the  smaller 
wave  numbers.  However,  for  fast  waves  we  ex¬ 
pect  the  pulse  to  become  narrow  and  so  to  have 
an  appreciable  component  from  higher  wave 
number  modes.  This  can  be  accommodated  by 
forming  an  approximation  of  A{k)  that  will 
be  globally  valid  in  k.  One  way  to  do  this  is  to 
match  an  approximation  2(k)  of  A{k)  at  the 
low  frequency  end  so  that  /1(0)  =  X(0)  and 
then  to  minimise  the  norm  of  the  difference 
j[||i4(^)  -  X(A:)||2).  Using  a  (0,2)  Fade  form 
X(k)  =  1/(1  +  aA:^),  the  L2  norm  is  minimised 
with  a  =  I  and  this  leads  to  the  ODE 


cH\-\D^)(t>=  V'(<i>) 
with  solution 

<t>(z)  =  2(c2_  1)  sech^  (3.6) 

Comparing  this  with  a  similar  solution  found 
earlier  using  the  (0,2)  Fade  approximation 


Much  has  been  written  about  the  asymptotic 
regime  c  1  (see  the  subsection  above).  Here 
the  solitary  waves  are  low  amplitude  and  slowly 
varying.  Continuum  approximations  are  appli¬ 
cable  and  results  are  often  easily  obtained  in 
terms  of  elementary  functions. 

However  in  the  other  limit,  c  -*  00,  very  little 
has  been  published.  A  simple  approximation 
has  been  provided  by  Druzhinin  and  Ostrovsky 
[13],  which  neglects  the  coupling  between  adja¬ 
cent  lattice  sites.  The  result  is  an  approximation 
in  terms  of  elementary  functions,  which  is  valid 
for  the  body  of  the  wave,  but  has  the  wrong 
asymptotic  behaviour  in  the  tail.  A  different  ap¬ 
proach  is  taken  in  [  14]  for  the  KP  lattice  where 
the  shape  of  the  high  speed  travelling  waves 
is  shown  to  be  determined  by  a  function  from 
the  class  Qp(z)  which  satisfy  the  differential- 
difference  equation 

Q';(z)=Qp{z  +  iy-2Qp{zr 

+  Qp{z-ir.  (3.7) 

This  equation  has  no  solutions  in  terms  of  ele¬ 
mentary  functions,  but  numerical  studies  indi¬ 
cate  that  it  is  even  and  pulse  shaped.  Numerical 
calculations  show  that  height  and  width  in  the 
first  two  cases  are  given  by 

//2«1.40,  1^2  «  1-44, 

7/3  a  1.30,  Wj  «  1.28. 
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We  now  apply  the  argument  in  [14]  to  the 
travelling  wave  equation  (1.3)  with  cubic  poten¬ 
tial  (3.1).  We  see  that  since  the  height  of  the 
travelling  wave  increases  with  speed  c,  the  po¬ 
tential  is  dominated  by  the  cubic  term  at  high 
speed  and  equation  (1.3)  can  be  approximated 
by 

^,^"(z)  =  ,i>Hz  +  l)-2<f>Hz)  +  (l>Hz-l)  . 

The  solution  of  this  equation  is  related  to  the 
solution  Qp  of  (3.7)  by  the  transformation 


to  enable  us  to  carry  out  integrations  explicitly. 
We  use 

X  =  {g(z)  =  £tanh(Fz);£:,F€R+}, 

and  note  that  we  are  working  with  the  variable 
g{z)  which  is  defined  in  terms  of  q„  (t)  appear¬ 
ing  in  (1.1)  using  the  travelling  wave  ansatz 
q„it)  =  q{n  -  ct)  =  qiz).  The  function  q(z) 
is  related  to  i^(z)  by 

<l>iz)=q{z+l)-q{z),  (3.9) 


<f>{z)  =  ^Qpiz)  .  (3.8) 

As  a  consequence,  the  height  of  the  pulse  0  is 
prediced  to  be  approximately  lAc^/a  at  high 
speed  and  the  width  is  fixed  at  1 .44  lattice  spac- 
ings. 

3.3.  Variational  formulation 


A  different  approach  to  forming  analytic  ap¬ 
proximations  is  to  make  use  of  the  Lagrangian 
structure  of  the  equations.  The  Lagrangian  for¬ 
mulation  gives  us  an  extremum  principle  for  the 
solution  of  a  system  of  equations.  From  the  ki¬ 
netic  (T(q))  and  potential  ( F  (^ ) )  Enei^es  we 
get  the  Lagrangian,  L  (9,9)  =  T- F.  Equations 
of  motion  are  then  generated  by  application  of 
the  Euler-Lagrange  equations. 


A  f^) 

dt  \dq„  / 


In  this  derivation,  however,  we  have  skipped 
over  an  important  point,  that  the  first  variation 
of  the  action  integral,  I  =  J  L  vanishes  (<5/  = 
(5  /  L  =  0).  It  is  this  part  of  the  theory  which  we 
exploit. 

In  a  fashion  reminiscent  of  finite  element  tech¬ 
niques,  we  shall  seek  such  a  critical  point  over 
a  restricted  space  of  functions  X,  for  the  travel¬ 
ling  wave  function,  q{z)  e  X.  We  pick  the  space 
X  to  match  important  qualitative  features  of  the 
solution  such  as  symmetry  and  shape  and  also 


and  forms  a  kink  soliton,  as  opposed  the  pulse 
shape  of  <f>.  This  restriction  now  means  that  I  is 
a  function  of  E  and  F,  so  that  setting  the  first 
variation  of  I  to  zero  corresponds  to  the  coupled, 
nonlinear  algebraic  equations 


As  might  be  expected,  the  algebra  is  messy. 
Using  an  algebraic  manipulation  package  we  can 
obtain  a  relationship  between  E  and  c  and  F  and 
c  and  we  can  extract  information  for  any  speed  of 
solitary  wave  using  numerical  routines.  Secondly 
we  can  use  asymptotic  analysis  to  investigate  the 
two  limits  c  — »  1  and  c  — ►  cx).  Both  of  these 
have  been  carried  out  in  detail  in  [16].  For  the 
cubic  potential  with  1  we  recover  all  of  the 
continuum  limit  results,  since  <^(z)  =  q(,z  + 
1)  -  qiz)  asymptotes  to  a  sech^  pulse.  In  the 
high  speed  region,  the  expressions  for  height  and 
width 


//3  =  (c^  -  1 )  [  1 .404  870  +  0.059 159c-^ 

+  0(c-‘»)],  (3.10) 

IFj  =  [  I  -433  565  -  0.346  955c-^ 

+  Oic-*)]  (3.11) 

show  agreement  to  within  about  2%  of  the 
Qpiz)  functions  described  earlier  and  graphs  of 
the  waveform  also  demonstrate  a  good  fit. 
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3.4.  Weak  formulation 

We  now  describe  a  simpler  method  for  the 
approximation  of  the  travelling  wave  equation 
(1.3)  which  uses  a  weak  formulation  of  the  prob¬ 
lem  and  can  be  shown  to  be  closely  related  to  the 
variational  formulation  described  above.  More 
details  can  be  found  in  [16].  We  multiply  (1.3) 
by  a  smooth  test  function  integrate  with 
respect  to  z  and  use  integration  by  parts  (assum¬ 
ing  appropriate  conditions  on  ( z )  and  ( z )  as 
|z|  -»  oo)  to  obtain 

OO 

j  i//"(z)4){z)dz 

—  OO 

OO 

=  J  V'icftiz)) 

—  OO 

X  [ti/{z  \  )  -  (z)  -V  {z  -  \  )\  dz. 

(3.12) 

This  equation  is  satisfied  by  any  resonably  well 
behaved  test  function  ( z )  when  (f>{z)  is  a  solu¬ 
tion  of  (1.3).  when  <f>iz)  is  a  solution  of  (1.3). 

Now  we  look  for  an  approximate  solution  of 
(1.3)  for  the  cubic  potential  (3.1).  We  use  the 
form 

(f>{z)  =  A  sech^(/?z)  (3.13) 

because  it  has  the  expected  symmetric  pulse 
shape  and  satisiies  ( 1.3)  in  the  continuum  limit. 
We  substitute  this  approximate  solution  in  the 
weak  form  (3.12),  select  two  simple  test  func¬ 
tions  {z^,  z^}  and  fix  the  constants  A,  by  solv¬ 
ing  the  resulting  algebraic  equations.  We  find 
that 


and 


=  3  (^)  •  (3.14) 

The  behaviour  as  c  — »  1  is  in  exact  agreement 
with  the  approximations  (3.3),  (3.4)  and  (3.6). 


Unfortunately,  numerical  tests  using  this  ap¬ 
proximate  solution  as  intial  data  show  that  it 
only  accurate  in  producing  travelling  waves  up 
to  speed  1.7  and  fails  to  produce  one  after  that. 
Further  numerical  tests  using  a  three  parameter 
approximation 

<f>{z)  =  A  sech"()?z) 

for  /I  =  4, 6, . . .  show  that  n  must  be  increased  to 
obtain  good  results  at  higher  speeds.  It  is  possible 
to  obtain  an  optimal  value  of  n  for  given  speed  c 
by  solving  the  algbraic  equations  obtained  from 
the  weak  form  (3.12)  with  three  test  functions 
{z^,  z^,  z^}.  This  is  quite  messy  and  valid  only 
up  to  c  «  2.5  where  n  oo  and,  taking  this  limit 
into  account,  the  approximate  solution  is 

4>(z)  =  a/2(c^  -  1 )  exp 

(3.15) 

for  all  higher  speeds.  The  numerical  tests  in¬ 
dicate  that  this  Gaussian  approximate  solution 
gives  very  good  results  at  high  speeds  and  that  it 
is  also  reasonably  good  at  low  speeds. 

This  technique  is  easy  to  use,  particularly 
when  the  approximate  solution  <l>  and  test  func¬ 
tions  y/  are  chosen  such  that  the  integrals  in  the 
weak  form  can  be  obtained  easily. 

3.5.  Numerical  results 

One  way  to  evaluate  the  accuracy  of  the  var¬ 
ious  solutions  given  above  is  to  compare  the 
speed-height  and  width-height  relationships 
for  the  travelling  waves  they  predict  with  those 
found  by  direct  numerical  approximation  as  de¬ 
scribed  in  section  1 .  The  results  for  the  approx¬ 
imations  described  in  the  previous  sub-sections 
are  shown  in  table  1 .  We  see  that  many  methods 
agree  in  the  low  speed  (c  «  1 )  region  since  they 
reproduce  correctly  the  simple  sech^  solitary 
wave  predicted  by  continuum  theory.  In  the 
high  speed  limit,  we  see  that  thf'  variational  ap¬ 
proximation  (3.9)  is  very  close  to  the  “correct” 
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Table  1 

Relationship  between  height,  width  and  speed  predicted  by  the  methods  of  this  section.  Width  refers  to  the  half  height 
width  of  the  pulse.  The  last  three  rows  give  the  leading  order  behaviour  for  c^  — >  1  and  -»  oo  as  appropriate.  We  use  the 
function  R^{c^)  =  c^/(c^  -  1). 


Solution/Type 

Amplitude 

Width 

low  speed 

high  speed 

low  speed 

high  speed 

(3.3)/Padc 

1.5  -  1) 

1.0177 /{(c2) 

(3.6)/Pade 

1.5  (c2-  1) 

1.7627 /{(<r2) 

(3.l3)/weak 

1.5  (c^  -  1) 

1.0177/{(c2) 

(3.15)/weak 

>/2  -  1 ) 

1.3596/?(c2) 

(3.4)/Pade 

1.5  (c2  -  1) 

1.3741  c2 

1.0177R(c2) 

1.4336  R(c2) 

(3.8)/fast 

not  valid 

1.40^2 

not  valid 

1.44 

(3.9)/variational 

1.5  (^2-  1) 

1.4049<r2 

1.0177/{(c2) 

1.3590 /?(c2) 

value  1  AOc^  found  by  accurate  direct  numerical 
calculation  using  spectral  methods  reported  in 
section  1.  Also,  we  see  that  the  pulse  width  is 
bounded  below  at  high  speeds  and  the  approxi¬ 
mate  solutions  have  significant  size  over  only  a 
few  lattice  spacings. 

Another  test  of  the  methods  is  to  use  the  solu¬ 
tions  they  predict  as  initial  data  in  the  ODE  sys¬ 
tem  (1.2)  and  measure  the  proportion  of  energy 
carried  by  any  resulting  travelling  wave  form  if 
the  solution  settles  down.  At  low  speeds  ( 1  < 
c  <  1.7),  all  the  approximations  which  repro¬ 
duce  the  continuum  results  (3.3),  (3.4),  (3.9) 
and  (3.13)  produce  very  clean  travelling  waves 
carrying  most  of  the  input  energy.  The  Fade  ap¬ 
proximation  (3.6)  and  the  weak  form  approxi¬ 
mation  (3.15)  are  designed  for  high  speeds,  but 
perform  reasonably  well  at  these  low  speeds.  At 
speeds  above  about  1 .7  some  of  the  methods  fail 
completely,  but  the  approximations  (3.4),  (3.6), 
(3.8),  (3.9)  and  (3.15)  are  all  designed  to  work 
at  these  speeds  and  perform  well. 


4.  Quantum  problems 

In  a  volume  dedicated  to  Alwyn  Scott,  we 
would  be  remiss  to  leave  the  topic  of  solitons 


on  lattices  without  mentioning  briefly  recent 
developments  in  the  study  of  the  corresponding 
quantum  version  of  these  problems  [20].  For 
example,  the  second  quantised  version  of  the 
Hamiltonian  for  the  DNLS  equation  (2.4)  is 


/ 

+  b]bj.^  A  \7b]b]bjbi), 
(4.1) 


where  b]  and  bj  are  bosonic  raising  and  lowering 
operators  satisfying  the  usual  commutation  rela¬ 
tions.  If  we  calculate  the  energy  levels  on  a  peri¬ 
odic  lattice  of  length  /  =  60,  with  two  quanta 
(the  smallest  non-trivial  case),  the  distribution 
of  energy  eigenvalues  as  a  function  of  momen¬ 
tum  k  looks  like  fig.  5.  The  distinct  band  be¬ 
low  the  quasi-continuum  is  the  so-called  soliton 
band.  A  state  in  this  band  has  a  high  probability 
of  having  both  quanta  on  the  same  site,  hence 
the  name  soliton  as  a  localized  lump  of  energy. 
However,  this  being  quantum  mechanics,  there 
is  equal  probability  of  finding  this  double  quanta 
on  any  site  of  the  lattice. 
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Fig.  S.  Quantum  energy  levels  for  the  DNLS  equation. 
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A  heuristic  estimate  for  the  soliton  production  rate  by  a  pulse  is  verified  for  the  Korteweg  -  de  Vries  equation  using 
inverse  scattering.  An  observation  from  this  result,  which  is  shown  to  hold  for  some  other  nonlinear  equations  and  for  the 
case  of  the  “forced"  nonlinear  Schrddinger  equation,  is  that  production  is  determined  by  quantities  that  are  invariant 
under  rescaling  of  the  original  nonlinear  equations.  We  speculate  that  this  result  may  be  useful  to  the  development  of  an 
inverse  scattering  theory  for  “forced"  nonlinear  systems. 


1.  Introduction 

A  heuristic  method  for  determining  the  num¬ 
ber  of  Korteweg -de  Vries  (KdV)  solitons  pro¬ 
duced  in  an  experiment  has  recently  been  re¬ 
ported  [1].  That  method  was  verified  numerically 
by  testing  it  on  the  Toda  lattice,  which  becomes 
a  KdV  in  a  continuum  limit. 

The  method  arises  in  noting  that  for  KdV 
solitons,  the  product  WVA  =  constant,  where  A 
is  the  soliton  amplitude  and  W  its  width.  Sup¬ 
pose  one  now  applies  to  the  medium  an  excita¬ 
tion  potential  amplitude  B,  width  L.  First  com¬ 
pute  the  integer  N  given  by 

N  =  iLVB/wVA) ,  (1) 

where  for  the  real  number  R,  (R)  denotes  the 
integer  part  of  R.  For  example,  if  R  is  written  as 
the  decimal  form  R  =  N.abc.  .  .  ,(R)  =  N.  Then 
eq.  (1)  gives  the  number  of  whole  boxes  of  area 
WVa  fitting  in  the  box  of  area  LVB.  Finally, 
remembering  that  the  solitons  are  nonoverlap¬ 
ping,  the  soliton  production  rule  is 

number  of  solitons  excited  =  N  +  \ .  (2) 


If  a  fraction  of  a  box  of  area  WVA  appears  in 
computing  (1),  i.e.  the  portion  O.abc. . .  of  R 
above,  this  fraction  is  taken  to  produce  “radia¬ 
tion  rather  than  another  soliton’’.  This  intuitive 
way  of  thinking  of  solitons  arising  from  a  pulse 
lacks  completely  any  theoretical  basis,  but  the 
accuracy  of  (2)  has  nonetheless  been  verified 
numerically  [1], 

General  solutions  to  soliton  equations  may  be 
obtained  by  the  method  of  the  inverse  scattering 
transform  [4,5].  The  nonlinear  problem  pre¬ 
sented  by  the  soliton  equation  is  mapped  to  a 
linear  scattering  problem,  for  which  the  solution 
of  the  nonlinear  equation  serves  as  the  scattering 
potential.  The  scattering  problem  is  solved  by 
standard  linear  techniques,  and  the  subsequent 
inverse  mapping  back  to  the  nonlinear  problem' 
provides  its  solution.  This  consists  of  a  number 
of  solitons  plus  radiation,  the  solitons  corre¬ 
sponding  to  the  bound  states  of  the  scattering 
problem,  the  radiation  to  the  continuum  (scatter¬ 
ing)  states. 

We  wish  now  to  establish  a  connection  be¬ 
tween  the  production  rule  (2)  and  inverse  scat¬ 
tering.  It  is  our  intention  in  this  note  to  suggest 
that  the  success  of  (1)  and  (2),  which  after  all 
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amount  to  no  more  than  a  simple  minded  scaling 
argument,  may  actually  rest  on  a  deeper,  mathe¬ 
matically  rigorous  property  of  soliton  equations. 
We  have  been  unable  to  discover  this  conjec¬ 
tured  rule  as  yet. 

Toward  our  goal,  we  present  in  section  2  a 
number  of  scattering  problems  for  soliton  equa¬ 
tions  indicating  a  connection  between  the  bound 
state  spectrum  (obtained  from  the  associated 
linear  scattering  problem)  and  the  scaling  prop¬ 
erties  of  the  original  nonlinear  soliton  equation. 
In  particular,  eqs.  (1)  and  (2)  are  verified  for  the 
case  of  the  KdV  equation.  A  possible  application 
of  the  present  observations  to  the  problem  of 
“forced  integrable  systems”  [6]  is  discussed  in 
our  conclusions  in  section  3. 


2.  Some  examples 

We  consider  first  the  KdV  equation 

d,q{x,  0  +  (iqd^q  +  d^q^O,  (3a) 

with  the  initial  condition 

q{x,0)  =  (rVg,  -a^x<a 

=  0 ,  |x|  >  a  .  (3b) 

The  sign  a  =  ±l  is  present  for  generality.  Asso¬ 
ciated  with  (3)  is  the  linear  scattering  problem 

[4] 

d^Vi+iiv^=q{x,0)v2, 

(4) 

for  the  eigenvalues  In  general,  ^  is  complex, 

f  =  ^  +  iT? .  (5) 

Solitons  produced  by  (3b)  are  determined  by  the 
bound  state  spectrum  of  (4)  in  the  upper  half 
^-plane,  i.e.,  t7>0  [4].  Such  states  vanish  as 


|jc|— >00.  With  v  =  (Vi,V2)  and  A,  B  arbitrary 
amplitudes,  we  obtain  from  (4)  the  solutions 

=  x>a; 

=  x<-a,  (6) 

which  have  the  appropriate  bound  state  behavior 
as  ljt|— »oo.  We  also  take  ^  =  0  for  the  bound 
case.  This  is  done  because  we  are  aware  be¬ 
forehand  that  u,  may  be  eliminated  from  (4), 
leaving  the  quantum  mechanical  Schrodinger 
equation  for  Uj  with  eigenvalues  f  This  Hermi- 
tean  operator  permits  only  real  so  that  ( 
corresponding  to  bound  states  is  purely  im¬ 
aginary. 

For  |jf|  ^  a,  insertion  of 

V  =  {r„r2)e^^^  (7) 

in  (4)  yields  the  condition 

r2ir,=\l(y-i)  =  \{y  +  C)laV,.  (8) 

With 

y  =  K  -t-  ip  ,  (9) 

we  obtain  the  two  solutions 
-V^  +  o-Vp 

P±  =  , 

y^  =  K^  +  ip^  .  (10) 

As  in  (8),  =  -i(7*  -  Or'2±  so  that 

=  -i(y.  -  Or2.p‘’^^  -i(y-  -  , 

v2  =  r2,p'^*^  +  r2.p'^-’‘  ■  (11) 

Continuity  of  the  solution  determines  the  match¬ 
ing  conditions 
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4t{x  =  +a)  =  vix  =  +a), 

0(x  =  -a)  =  v{x  =  -a)  . 

With  a  little  algebra  this  gives 

(12) 

(13) 

Finally,  recalling  ^  =  0  for  bound 
letting 

states  and 

X  =  K+a  ,  y  =  Tja  , 

(14) 

equating  real  and  imaginary  parts  of  (13)  gives 

-I-:  A' tan  A'=y, 

(15a) 

A'cosA'=-y. 

(15b) 

Also,  in  either  case,  from  (10), 

X^  +  Y^  =  a\(rV„)3:0 . 

(16) 

Using  (8)  we  find  that  for  (15a),  =  r2_,  an 

even  solution,  while  for  (15b)  an 

odd  solution. 

As  previously  stated,  upon  eliminating  Uj  from 
(4),  one  derives  the  quantum  mechanical  Schro- 
dinger  equation  for  the  potential  -qix,  0).  For  a 
different  symmetric  potential,  the  conditions 
(15),  (16)  have  been  obtained  and  solved  graphi¬ 
cally  [7].  The  results  is  that 

number  of  bound  states  =  +  1 

if 

Nir  <  2a\ja-VQ  <{N  +  1)17 . 

(17) 

With  the  correspondence  WV4— »Tr  and  LVB  = 
2aVlK)l»  this  is  just  the  result  (1),  (2).  We  note 
here  that  the  one  soliton  solution  ()  of  (3a) 

q,(x,  t)  =  (2 1 W^)  sech^([jc  -  4//W^]/W) 

obeys 


j  dx  VqJxTi)  =  ttV2  . 

—  OB 

If  we  substitute  Y  from  (15)  into  (16)  and 
consider  the  limit  X-*0,  we  find  with  (15a)  that 
Y~~  X^>0  and  X  ==  aViVo|.  Thus  this  solution, 
the  even  case,  exists  for  arbitrarily  small  Vq.  Tliis 
is  the  origin  of  the  additional  term  of  unity 
added  to  N  in  (2).  There  is  no  threshold  con¬ 
dition.  For  the  odd  case,  (15b),  this  limit  gives 
for  A’-»0  (but  ^9^0),  l  =  aVil^ol-  There  is  a 
threshold  for  the  odd  case. 

Considering  next  the  invariance  of  (3a)  under 
the  scaling  transformation 

x'  =  e‘‘x,  =  (18a) 

we  find 

6  =  3a,  c=-2a,  (18b) 

so  that 

xVq  =  x'V^.  (18c) 

This  invariant  combination  reflecting  shape  pre¬ 
servation  under  (18a)  by  the  nonlinear  differen¬ 
tial  equation  is  interesting  if  we  make  the  as¬ 
sociation  of  X  with  2a  and  q  with  crVo  =  |Vo|.  To 
the  extent  that  this  purely  dimensional  associa¬ 
tion  actually  means  something,  the  invariant 
quantity  in  (18c)  is  seen  by  (17)  to  be  the 
determiner  of  soliton  production.  The  soliton 
spectrum  was  found  from  the  linear  scattering 
problem  using  the  initial  condition  for  the  non¬ 
linear  equation.  We  will  return  to  this  shortly. 

We  consider  next  the  nonlinear  Schrodinger 
equation  (NLSE) 

id,q{x,t)+  dlq±2q\q\^  =  0,  (19) 

for  which  the  associated  scattering  problem  is 
+  ifu,  =  qv2 , 

d,V2-i(v2  =  +q*v^.  (20) 
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Again,  with  cr  =  ±1,  let 
q{x,Q)  =  aVo,  -a<jcsa, 

=  0,  k|>a-  (21) 

For  bound  state  solutions  with  ( in  the  upper  half 
plane,  solutions  are 

=  (0, dp'^'‘)  ,  x>a  -, 

(p  =  ,0) ,  x<-a  .  (22) 

For  |jt|  a,  use  of  the  form  (7)  gives 

r,/r,  =  crVKy  +  o  =  Kr  -  f)/(^‘^v;) , 

(23) 

while  use  of  (5),  (9)  yield 

+  ([±|V'or  +  ^^-T,Y  +  4(^)T^]}‘'^ 

p*  = 

•y*  =  Kj.  +  ip*  .  (24) 

Again  we  use  the  conditions  (12).  Then  = 
-7+  results  in 

(-y.  +  ne"‘’'^“  =  (r.  +  Oe-^‘’'^‘’  (25) 

analogous  to  (13).  Defining 
X  =  2aK^,  P  =  2a^, 

r  =  2ap+  ,  Y  =  2ai7  ,  (26) 

we  obtain  upon  equating  real  and  imaginary 
parts  of  (25) 

cos  X  [e"^  (AT  +  p)  -  e'"^  {-X  +  p)] 

=  -sinA'[eV+  F)  +  e"'(-r+  F)] , 

cos  X  [e'  (r  +  y)  -  e“"  (-r  +  F)] 

=sinA'[e'(A'  +  p)  +  e-'(-A'  +  p)],  (27a) 


and  from  (24), 

Ar^=  A{±4fl^|Fol"  +  p''- 
+l(±4a^|Fol^  +  p^  -  Vy  +  4(py)^]‘'^}  . 

(27b) 

In  terms  of  a  single  NLSE  soliton  may  be 
written  as  [6] 

a(x  t\  =  exp[-4i(^'-9")^-2i^Ac] 

cosh[2ri(x-Xo)  +  ^it]  ’ 

the  speed  of  which  is  seen  to  be  dx/dt=  -4^. 
Then  the  condition  for  production  of  solitons  at 
rest  is  found  by  letting  ^  =  0.  From  (26),  p  =  0  = 
r  and  (27)  is 

ATcot  A'=  -F, 

F^  =  ±(201^01)^^0,  (28) 

so  that  only  the  +  sign  may  be  considered.  This 
is  the  odd  case,  (15b)  and  (16),  of  the  KdV 
example.  As  noted,  there  is  a  production 
threshold  for  IFqI-  Eliminating  F  and  letting 
A'-oO,  we  find  F^-1,  =  (201^0!)^  -  1  ^0. 

The  quantity  governing  production  (i.e.,  the 
bound  state  spectrum)  in  (28)  is  2fl|V'ol-  Accord¬ 
ing  to  the  earlier  dimensional  speculation,  this 
corresponds  to  xq.  Substituting  the  scaling  (18a) 
into  (19)  yields 

b  =  2a  ,  c  =  —a  , 

x'q'  =  xq  .  (29) 

Interestingly,  the  modified  Korteweg-de 
Vries  (MKdV)  equation 

d,q(x,  t)  ±  bq^qx  +  =  0  (30) 

yields,  upon  employing  (18a), 
b  =  3a,  c  = -a^ x'q' =  xq  , 


(31) 
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i.e.,  the  x,  ^-invariance  as  the  NLSE.  An  analy¬ 
sis  along  previous  lines  of  the  associated  linear 
scattering  problem 

-t-  i{u,  =  qv^  , 

d^V2- +qv^  (32) 

leads  to  the  condition  (25)  already  obtained  for 
the  NLSE.  Of  course,  subsequent  time  evolution 
is  different  in  the  two  cases.  In  particular,  b  is 
different  in  (29)  and  (31). 

We  also  see  from  (29)  that  tq‘  =  t'q'^  holds. 
As  the  inverse  scattering  transform  contains  a 
pair  of  equations  describing  the  time  evolution  of 
the  scattering  functions  [4],  the  previous  discus¬ 
sion  suggests  that  a  determination  of  the  eigen¬ 
value  problem  using  the  temporal  rather  than  the 
spatial  scattering  problem,  would  show  the 
eigenvalue  spectrum  to  be  determined  by 
with  T  the  time  of  a  pulse  IV',,!  applied  to  the 
system  at  some  fixed  position  x.  Indeed,  a  con¬ 
sideration  of  the  forced  NLSE,  driven  by  a  time 
dependent  pulse  at  jt  =  0,  shows  this  to  be  the 
case  [6|.  (To  see  this  in  ref.  [6j,  one  must  note 
that  in  eq.  (3)  of  that  paper,  a  =  and  then 
consider  eqs.  (16)  and  (17),  along  with  the  inter¬ 
vening  discussion.) 

3.  Discussion 

For  the  KdV,  the  single  soliton  solution  shows 
that  the  product  of  width  times  the  square  root 
of  the  amplitude  is  constant.  We  have  seen  that 
the  heuristic  estimate  of  soliton  production  made 
by  considering  a  pulse  of  certain  width  and  am¬ 
plitude  as  an  integral  number  of  solitons  accord¬ 
ing  to  the  box  counting  procedure  given,  with 
any  excess  relegated  to  “radiation”,  holds  rigor¬ 
ously  in  the  KdV  example.  We  also  note  that  the 
powers  of  width  and  amplitude  determining  the 
soliton  spectrum  are  actually  those  obtained 
upon  demanding  invariance  under  a  rescaling  of 
the  KdV,  and  that  a  similar  result  occurs  for  the 


NLSE  and  MKdV  This  not  only  held  for  the 
initial  value  problem  (the  spatial  evolution  equa¬ 
tions  of  the  inverse  scattering  transform),  but 
also  for  the  “forced”  problem,  as  seen  by  consid¬ 
ering  the  time  evolution  equations  of  the  inverse 
scattering  transform  the  NLSE. 

We  established  eq.  (17)  with  use  of  the  inte¬ 
gral  of  the  single  KdV  soliton.  In  the  spirit  of  the 
WKB  we  generalize  the  results  to  speculate  that 

ldxq^(x,0)=CJV, 

with  A/  =  integer  number  of  solitons  produced,  C 
the  constant  determined  by  V  =  1 ,  and  F  the 
exponent  obtained  from  the  scale  invariance  of 
the  governing  equation.  A  similar  formula  can 
be  written  with  time  integration  for  the  “forced” 
problem.  As  such  a  relation  relates  the  soliton 
production  to  a  certain  integral  of  the  initial/ 
boundary  conditions,  our  speculation  is  that  the 
scaling  behavior  of  the  nonlinear  differential 
equation  could  perhaps  provide  a  useful  ap¬ 
proach  to  solving  the  problem  of  developing  an 
inverse  scattering  transform  for  forced  nonlinear 
equations  [8]. 
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We  discuss  the  advantages  of  parallel  computing  with  respect  to  both  computing  speed  and  coding  techniques.  As 
an  example  of  an  application,  we  show  results  of  dynamical  simulations  of  two  dimensional  Josephson  lattices  in 
external  magnetic  fields.  Using  data  parallel  computing,  we  are  able  to  study  the  dynamics  of  very  large  systems  and 
identify  the  interplay  between  spatial  and  temporal  scales  in  complex  systems. 


1.  Introduction 

Typically  nonlinear  problems  with  origin  in 
condensed  matter  physics  and  materials  science 
are  ideally  suited  for  implementation  on  mas¬ 
sively  parallel  computers.  This  is  of  course  true 
for  many  other  branches  of  science,  but  for  con¬ 
densed  matter  problems  the  reasons  are  espe¬ 
cially  clear.  First,  nonlinear  problems  in  higher 
dimensions  are  most  often  formulated  on  a  lat¬ 
tice,  the  lattice  for  example  represents  the  under¬ 
lying  discreteness  of  a  substrate  of  atoms  or  dis¬ 
crete  /c -values  in  Fourier  space.  The  fundamen¬ 
tal  unit  of  parallelism  thus  has  a  direct  analog 
in  the  physical  system  we  are  interested  in.  Sec¬ 
ondly,  we  are  often  interested  in  understanding 
the  influence  of  temperature  on  the  system,  i.e. 
the  system  is  in  contact  with  a  heat  bath.  There 
are  of  course  several  was  to  introduce  tempera¬ 
ture  in  a  problem,  but  often  it  is  most  convenient 
to  operate  in  the  canonical  ensemble,  i.e.  having 
the  temperature  constant  and  the  energy  fluctu¬ 
ate.  This  situation  is  also  closer  to  many  exper¬ 
imental  setups.  Third,  interactions  are  often  lo¬ 
cal,  i.e.  we  can  represent  the  important  physics 
with  a  nearest  or  next-nearest  neighbor  interac¬ 
tion.  Even  when  this  is  not  the  case  parallel  im¬ 
plementations  of  the  problem  is  often  possible 


which  nevertheless  maintain  a  local  communi¬ 
cation  topology  on  the  machine.  Fourth,  we  are 
often  interested  in  the  behavior  of  large  systems, 
as  close  to  experimental  situations  as  possible. 
This  naturally  leads  to  formulations  with  a  large 
number  of  lattice  sites,  especially  for  2D  and  3D 
problems.  Finally,  we  are  interested  in  the  dy¬ 
namic  response  of  the  system,  we  want  to  follow 
its  time  evolution.  All  of  these  factors  lead  to  a 
formulation  of  the  problem  in  terms  of  a  lai^e 
number  of  coupled  Langevin  ordinary  differen¬ 
tial  equations  with  near  neighbor  interaction. 

Fine  grained  massively  parallel  multicomput¬ 
ers  like  the  CM-2  and  CM-5  from  Thinking  Ma¬ 
chines  Corporation  are  ideally  suited  for  these 
types  of  problems.  We  will  illustrate  this  point 
in  the  following,  first  with  a  simple  example  of 
a  data-parallel  solution  to  Laplace’s  equation, 
secondly  with  an  example  of  data-parallel  tech¬ 
niques  applied  to  problems  from  the  dynamics 
of  Josephson  junction  arrays.  One  might  argue 
that  massively  parallel  computers  are  very  ex¬ 
pensive  and  only  accessible  to  a  few  fortunate 
researchers  and  therefor  not  of  general  interest. 
We  do  not  think  that  argument  holds.  Key  com¬ 
puter  industry  analysts  now  predict  that  by  the 
end  of  this  decade  the  majority  of  computers, 
from  the  PC  and  workstation  to  the  high-end 
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number-chruncher  will  be  multiprocessors  and 
parallel  processors.  Over  the  next  five  years,  it 
is  critical  that  computational  physicists  under¬ 
stand  this  revolutionary  technology  if  we  want  to 
make  progress  on  “Grand  Challenge”  problems 
in  nonlinear  science  and  elsewhere. 


2.  Data-parallel  programming 

In  this  section  we  will  briefly  illustrate  how 
programming  a  massively  parallel  computer  is 
surprisingly  simple  when  the  data-parallel  ap¬ 
proach  is  followed.  In  fact,  once  a  few  basic  con¬ 
cepts  are  understood  a  typical  code  is  signifi¬ 
cantly  simpler  that  a  corresponding  code  for  a 
serial  computer. 

The  Connection  Machine  2  (CM-2)  is  a  mas¬ 
sively  parallel  distributed  memory  computer 
containing  between  4096  and  65536  bit-serial 
processors.  Each  32  processors  share  a  floating¬ 
point  unit,  so  for  most  scientific  calculations 
a  65536  processor  CM-2  can  also  be  viewed 
as  a  2048  floating-point  processor.  The  CM-2 
is  a  SIMD  (single  instruction-multiple  data) 
computer,  i.  e.  all  the  processors  execute  the 
same  instruction  in  lock  step.  All  processors  are 
synchronized  at  all  times  and  problems  such  as 
dead-locks,  race-conditions,  etc.  can  not  occur. 
Each  processor  has  control  over  its  own  mem¬ 
ory,  -  up  to  128  Kbytes,  for  a  total  of  8  Gbytes 
on  a  64K  processor  machine.  For  communica¬ 
tions  purposes  the  processors  can  dynamically 
be  arranged  as  a  hypercube  shape  in  up  to  31 
dimensions.  For  most  applications  in  nonlinear 
condensed  matter  physics  the  dimension  of  the 
hypercube  is  low,  -  corresponding  to  2D  and  3D 
problems.  The  CM-5  is  a  newer  and  more  gen¬ 
eral  massively  parallel  multicomputer,  which 
in  addition  to  the  SIMD  data-parallel  program¬ 
ming  model,  also  supports  the  more  general 
MIMD  (multiple  instructions-multiple  data) 
programming  concept.  The  interested  reader  is 
referred  to  [  1  ]  for  more  details. 


The  data-parallel  computing  model  maps  very 
cleanly  on  to  the  CM-2  SIMD  structure.  Data 
parallel  computation  is  parallelism  through  the 
simultaneous  execution  of  the  same  instruction 
on  a  large  set  of  data.  This  should  be  contrasted 
with  control-parallelism  ,  where  speed-up  is 
achieved  through  the  simultaneous  execution  of 
different  instructions.  This  kind  of  parallelism 
is  much  harder  to  manage  and  best  results  are 
usually  achieved  at  the  hardware  level.  The 
data-parallel  concept  is  most  easily  illustrated 
with  an  example.  Given  three  N  x  N  {N  =  256 
e.  g.)  matrices  A,B,  and  C;  a  matrix  element 
from  each  matrix  is  associated  with  a  proces¬ 
sor  -  i.  e.  each  processor  has  allocated  memory 
for  three  matrix  elements  only.  The  statement 
C  =  A  +  B  is  a  single  statement  in  a  program 
and  will  be  executed  as  such  in  the  data-parallel 
computing  model.  One  each  processor  the  indi¬ 
vidual  matrix  element  calculation  is  performed 
simultaneously.  This  high-level  abstract  model 
is  important  because  it  makes  programs  less 
machine  dependent  and  thus  more  portable. 
The  data-parallel  model  is  supponed  in  the  lan¬ 
guages  C*  and  CM-Fortran  on  the  CM-2  and 
CM-5.  The  data-parallel  languages  also  support 
the  notion  of  virtual  processors.  Let  us  say  that 
N  in  the  above  example  was  512,  in  this  case 
=  262144  which  is  four  times  larger  than 
the  maximum  number  of  processors.  With  vir¬ 
tual  processors,  each  real  processors  is  now  per¬ 
forming  the  work  of  four  fictitious  processors. 
Of  course,  at  this  level  the  work  for  each  of  the 
four  virtual  processors  must  proceed  in  serial 
on  each  of  the  real  processors.  Nevertheless, 
this  model  of  computing  is  extremely  useful  and 
conceptually  simple.  Note  for  example,  that  a 
program  written  for  say  a  65K  processor  CM-2 
will  run  unchanged  on  a  16K  CM-2,  but  take 
four  times  as  long.  The  current  hardware  of  the 
CM-2  restricts  the  layout  of  the  processors  to 
have  axis  that  are  powers  of  two.  On  the  CM-5 
these  restrictions  do  not  apply. 

As  a  slightly  more  interesting  example,  con¬ 
sider  a  finite-difference  approximation  for  calcu- 
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slxap«  [512]  [512]grid: 

float :grld  a,  b; 

boolaanigrid  intarior.aloMat ; 

/•  Initialize  the  variables  to  soaething  ^propriate  */ 

where  ( inter ior.eleaent)  { 

b  ■  [.  ♦  l][.]a  +  [.  -  l][.]a  [.][.  ♦  l]a  ♦  [.][.  -  l]a 

-  4  ♦  [.][.]a: 

>  else  { 


/•  Do  soMthing  for  boundary  eleaents  •/ 


} 

Fig.  1.  C*  code  implementing  a  fmilr  difference  approximation  for  calculating  the  Laplacian  in  2D. 


lating  the  Laplacian  in  2D.  We  would  discretize 
space  such  that  aix,y)  =  aUAxJAy)  = 
and  an  approximation  of  the  Laplacian  correct 
to  2nd  order  in  Ax  and  Ay  would  be:  (a, +17  + 
Oi-ij  +  Oij+i  +  a,j_i  -  4aij)/Ax^Ay^.  The  es¬ 
sential  part  of  a  C*  program  implementing  this 
calculation  would  look  like  fig.  1. 

The  first  line  defines  a  processor  layout  with 
the  new  parallel  C  type  shape.  Here  we  have 
defined  a  S12  x  S12  2D  shape  called  grid. 
The  next  line  defines  two  floating-point  parallel 
variables,  a  and  b,  each  with  the  grid  shape. 
The  third  line  defines  a  similar  logical  variable, 
interior-element.  The  rest  of  the  code  show  a 
where-else  control  structure,  the  body  of  which 
calculates  the  actual  Laplacian.  The  left-index 
notation  [.][.+!] a,  is  another  C*  extension 
that  allows  addressing  of  neighboring  proces¬ 
sors.  The  “dot”  notation  should  be  read  as  “this 
processor’s  coordinate”.  This  code  is  executed 
on  all  processors  in  parallel  and  depending  on 
the  value  of  interior-element,  a  processor  will 
either  sit  idle  or  participate  in  the  calculation 
of  the  interior  Laplacian.  The  code  to  initialize 
interior-element  has  been  omitted.  The  else 
branch  of  the  control  structure  is  then  subse¬ 
quently  executed,  here  only  the  boundary  pro¬ 
cessors  are  active  and  all  others  sit  idle.  On 
the  CM-5  the  two  parts  of  the  control  structure 
could  execute  in  parallel. 

We  have  chosen  to  show  an  actual  piece  of 


code,  because  we  believe  that  it  shows  most 
convincingly  that  the  basic  concepts  of  data- 
parallel  programming  are  indeed  extremely  sim¬ 
ple.  In  comparison  with  a  conventional  serial 
program,  one  notes  the  complete  absence  of  any 
loops  over  the  lattice  sites.  The  Laplacian  is  cal¬ 
culated  with  one  statement  in  the  data-parallel 
code,  which  represents  very  cleanly  how  we 
think  about  the  problem.  Having  illustrated  the 
basic  principles  behind  data-parallel  computing 
we  now  turn  to  a  real  application  from  the  dy¬ 
namics  of  2D  Josephson  junction  arrays.  Inter¬ 
ested  readers  who  wants  to  know  more  about 
data-parallel  programming  are  referred  to  [2]. 


3.  Josephson  junction  arrays 

As  an  example  of  an  illustrative  system  mod¬ 
eled  by  data-parallel  computing  we  show  long¬ 
time  Langevin  dynamics  simulations  of  large 
{Nx  X  Ny,  ;Nx  =  Ny  =  128)  2-dimensional 
arrays  of  Josephson  junctions  in  a  uniformly 
frustrating  external  magnetic  field.  The  results 
demonstrate:  (i)  Relaxation  from  an  initially 
random  flux  configuration  as  a  “universal”  fit  to 
a  “glassy”  stretched-exponential  type  of  relax¬ 
ation  for  the  intermediate  temperatures,  and  an 
“activated  dynamic”  behavior  for  r  ~  (ii)  A 
glassy  (multi-time,  multi-length  scale)  voltage 
response  to  an  applied  current.  Intrinsic  dynam- 


P.S.  Lomdahl,  N.  Grenbech-Jensen  /  Langevin  dynamics  simulations  of  Josephson  junction  arrays 


21 


ical  symmetry  breaking  induced  by  boundaries 
as  nucleation  sites  for  flux  lattice  defects  gives 
rise  to  transverse  and  noisy  voltage  response. 

The  Lagrangian  for  the  JJA  takes  the  form  (see 
refs.  [3-5]  and  references  therein) 

C  =  —Eq  ^  [2  —  cosidjj  —  di-ij  —  Ai-ijjj) 
ij 

—  cos(0jj  —  6ij—\  —  )  ] 

+  {Co'£dfj-E^Y,eij(s^j-SN,j).  (1) 
iJ  iJ 

Here,  dij  is  the  phase  of  the  superconduct¬ 
ing  island  with  the  discrete  coordinates  (/  = 
j  =  1 , 2, . . . ,  )  of  the  lattice,  and 

Aij  ki  =  (lejhc)  A  ■  dl  is  the  integral  of  the 
vector  potential  from  island  (i,j)  to  a  neigh¬ 
boring  island  {k,l).  The  energy  associated  with 
each  Josephson  link  is  denoted  by  Eq,  and  the 
capacitance  between  each  superconducting  is¬ 
land  and  the  ground  plane  is  Cq.  An  external 
potential  Ei  is  imposed  at  the  edges,  Sjj  =  1 
for  i  =  j  and  <5, 7  =0  for  i  ^  j.  The  vector 
potential  A  is  given  in  the  Landau  gauge, 

Au,ki  =  (2) 

Xij  =  xfj  +  Sxij ,  yij  =  y^j  +  Syij. 

Here,  Sxij  and  Sy^  represent  a  spatial  disor¬ 
der  around  the  ordered  positions  {x^,yjj).  The 
model  of  the  Josephson  links  sketched  above 
lacks  two  potential  contributions.  One  is  the 
capacitive  coupling  between  the  superconduct¬ 
ing  islands.  This  interaction  introduces  implicit 
terms  in  to  the  equations  of  motion  and  compli¬ 
cates  the  computational  procedure  slightly,  but 
there  are  no  formal  problems  in  the  modeling 
as  can  be  learned  from  e.g.  ref.  [6].  We  have, 
however,  decided  to  study  the  system  assuming 
that  a  ground  plane  is  present,  since  this  case 
makes  the  capacitance  between  the  islands  in- 
signiflcant  when  compared  to  the  capacitance  to 
the  ground  plane.  The  other  contribution  to  the 
dynamics  is  the  long  range  inductive  interac¬ 
tion  between  the  currents  in  the  network.  This 


depends  on  the  magnetic  properties  around  the 
system  and  the  inductive  interaction  is  there¬ 
fore  not  easy  to  define  theoretically.  Secondly, 
this  type  of  interaction  poses  potential  problems 
for  a  numerical  scheme  since  the  dynamics  at 
a  given  point  depend  on  all  other  variables  in 
the  system.  Some  attempts  to  model  the  mu¬ 
tual  inductance  have  recently  been  published 
[7],  using  interaction  between  nearest  neighbor 
plaquettes  only.  However,  omitting  the  induc¬ 
tive  interaction  does  not  seem  to  prevent  the 
model  from  presenting  results  that  are  close  to 
experiments  (see  e.g.  ref.  [8]). 

The  /i/y  */  summed  around  a  plaquette  obeys 
the  following  relation:  Aijj^i  -y  Akijci-\  + 
Ak-\i-\,ij-\  +  Ajj-ijj  =  2nf,  where  the  frus¬ 
tration  /  =  //a^/d>o  is  a  constant  giving  the  av¬ 
erage  number  of  flux  quanta  d>o  =  hcl2e  of  the 
external  magnetic  field  H  through  the  area  of 
each  plaquette  of  the  array.  We  also  introduce 
the  fractional  charge  Qij,  obtained  as  the  gauge 
invariant  phase  sum  around  the  i j’th  plaquette: 

Qij  =  -  Aij-xjj)  mod  n 

+  {dij-i  -  mod  n 

-f-  (0,_1;-1  -  0,-17  -  /I, -17,, -17-1 )  mod  n 
+  (0,_  17  -  0,7  -  ^,7.,_  1 7 )  mod  71  ] .  ( 3 ) 

The  dynamical  equations  derived  from  eq.  ( 1 ) 
are  (in  normalized  units) 

0,7  =  sin  (0, -7+1  —  dij  —  Aijjj+i ) 

-l-  sin(0/7_i  —  dij  —  Aijjj^i) 

+  sin(0,-^.i7  —  dij  —  Aiji^ij) 

+  sin  (0,-17  —  dij  —  Aijj^ij) 

~  +0/7-1  +  0/+l7  +  0/-l7~‘^0/7) 

+  ?.ij(t)  +  JxiSij-dNj).  (4) 

Time  is  normalized  to  r  =  {Cofi/2eIo)^^^, 
where  Iq  =  (2c/^)£o  is  the  critical  current  of 
a  link,  and  the  normalized  external  bias  cur¬ 
rent  at  the  edge  points  is  J\  =  {2elhIo)Ei. 
The  normalized  dissipation  is  given  hy  tj  = 
(l//?)(ft/2cCo/o)‘^^  with  R  the  normal  resis¬ 
tance  of  the  junctions,  and  the  thermal  noise 
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associated  with  the  dissipation  is  introduced 
in  the  classical  Langevin  sense:  =  0, 

=  2tjTS(t  -  t'),  where  the  tem¬ 
perature  T  is  normalized  to  Eo/k^,  being  the 
Boltzmann  constant.  We  note  here  that  the  form 
of  eq.  (4)  lend  itself  to  a  very  natural  imple¬ 
mentation  in  data-parallel  C*,  like  the  example 
in  the  previous  section. 

(i)  Relaxation  studies  with  periodic  boundary 
conditions: 

For  this  part  of  the  study  we  have  chosen  the 
damping  parameter  to  be  /;  =  1,  and  of  course 
J\  =  0,  since  there  are  no  open  boundaries.  The 
periodic  boundary  conditions  are  implemented 
through  the  physical  observables  -  not  through 
the  phase,  dij.  For  any  choice  of  gauge  for  the 
external  magnetic  field  there  are  many  ways  of 
choosing  the  boundary  conditions  for  djj.  We 
have  made  the  choice  of  introducing  the  phases: 

0Oj  = 

+  =  S\j  +  ^\J,N,  +  IJ  -  Aoj^ij 

diO  =  GiN,, 

+  \  —  ^/1>  (5) 

which,  when  applied  to  the  system  defined 
above,  will  give  the  right  periodicity  of  the 
physical  system.  The  ground  state  for  the  maxi¬ 
mal  frustration  parameter  /  =  j  the  fractional 
charge  q  exhibits  a  checkerboard  pattern,  q  = 
±  ( - 1  .  Therefore  the  following  gauge  invari¬ 

ant  correlation  function  is  defined  as  a  measure 
of  the  magnetization  of  the  system  (for  /  =  5 ): 


X  51 QU  1  j  +  <ii- \j  +  <hj 1  +  <7/7- 1 )-  (6) 

ij 

This  quantity  is  C  =  - 1  for  the  /  =  j  ground 
state  and  C  =  0  for  the  random  flux  configu¬ 
ration.  The  initial  conditions  in  6ij  are  random 
and  we  have  followed  the  dynamics  in  normal¬ 
ized  times  up  to  10'*. 

We  have  shown  the  evolution  of  C(0  for  var¬ 
ious  temperatures  T  in  fig.  2.  It  is  important  to 


note  that  the  transition  temperature  for  /  = 
j  is  Tf  ~  0.45:  for  T  >  Tc  long-range  flux  or¬ 
der  and  superconductivity  are  lost.  In  fig.  2a  we 
see  how  the  large  temperature  prevents  the  sys¬ 
tem  from  reaching  its  ground  state  {T  =  O.S) 
and  how  a  lower  temperature  (T  =  0.3)  results 
in  a  final  state  very  close  to  the  ground  state, 
C  =  - 1 .  However,  we  also  observe  how  a  very 
small  temperature  (T  =  0.01)  may  result  in  a 
final  state  which  is  far  from  the  ground  state. 
This  behavior  of  the  final  state  as  a  function  of 
the  temperature  can  be  understood  as  trapping 
into  metastable  (after  the  initial  relaxation)  flux 
configuration  at  very  low  temperatures  because 
of  the  uniform  frustration.  A.S  the  temperature  is 
increased  the  thermal  tunneling  over  the  frustra¬ 
tion  barriers  is  allowed  and  C  approaches  closer 
to  the  ground  state  value.  For  further  increasing 
temperatures  the  thermal  energy  approaches  (or 
exceeds)  the  frustration  pinning  energy  and  ther¬ 
mal  randomization  occurs  (C  -♦  0  for  T  -♦00). 

In  fig.  2b  we  have  shown  the  quantity 
In  In  C(/)  as  a  function  of  In/.  For  the  interme¬ 
diate  temperatures  {0.3Tc  ^  T  O.lTc)  we 
find  an  excellent  fit  to  a  ‘glassy’  stretched  ex¬ 
ponential  type  of  relaxation  [10].  Specifically, 
we  find  the  dependence,  C(t)  ~  exp[-(//T)^], 
with  «  0.45,  in  good  agreement  with  studies 
of  other  multi  time-scale  systems  [10]. 

In  fig.  2c  we  have  shown  lnln|C(/)|  as 
a  function  of  Inin/  for  a  few  temperatures 
around  the  critical  value  Tc  =  0.45.  Here  crit¬ 
ical  effects  dominate  and  ‘activated  dynamics’ 
has  been  proposed  in  the  form  [10],  C(/)  ~ 
exp  [  -  ( In  (  //t  ))**].  As  is  seen  this  is  in  fair  agree¬ 
ment  for  T  K.  Tc,  where  we  find  the  exponent, 
d  «  0.9. 

Figure  3  displays  the  states  for  /  =  40000 
for  three  different  temperatures,  (a)  T  =  0.01, 
(b)  T  =  0.25,  and  (c)  T  =  0.45.  We  have 
shown  (left)  the  staggered  order  parameter, 
qij  =  (-!)'■'■-',  since  this  quantity  will  be  uni¬ 
form  as  a  function  of  i  and  j  if  the  system  is 
in  a  ground  state  /  =  j.  We  have  also  shown 
(right)  the  spatial  Fourier  transforms,  g(k),  of 
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Fig.  2.  The  global  correlation  function  C(t)  during  relaxation  from  random  initial  condition  for  /  =  j  and  zero  disorder, 
Sxij  =  Syij  =  0. 


the  shown  qij  distributions.  This  function  will 
be  sensitive  to  the  distribution  of  domain  sizes 
rather  than  the  detailed  structure  of  domain 
walls.  Comparing  the  structures  of  the  different 
temperatures  we  see  that  the  intermediate  tem¬ 
perature  creates  a  system  which  is  dominated 
by  only  a  few  k  scales,  when  compared  to  the 
system  in  very  small  (a)  or  in  large  tempera¬ 
ture  (c).  The  spatial  structures  of  the  states  also 
show  some  qualitative  differences  between  the 
very  cool  and  the  warm  system.  Looking  at  the 
spectral  plots  we  find  that  the  spectral  power 
centers  around  the  horizontal  and  vertical  axes, 
whereas  for  the  warm  system  the  distribution 
is  more  independent  of  the  angular  orientation. 


This  is  due  to  the  thermally  nucleated  vortex 
antivortex  pairs  visible  in  fig.  3c  (left). 

Figure  4  shows  how  a  thermally  nucleated 
vortex-antivortex  pair  may  be 
annihilated  by  a  domain  wall.  For  three  differ¬ 
ent  times  we  show  the  same  part  of  the  system. 
The  first  shows  the  domain  wall  and  the  nucle- 
ation  in  its  close  vicinity.  The  second  shows  the 
wall  getting  contact  with  the  nucleation,  and  the 
third  shows  the  total  annihilation.  The  normal¬ 
ized  time  span  from  fig.  4a  to  3c  is  «  2. 

(ii)  Externally  driven  dynamics  with  open 
boundary  conditions;  In  this  case  the  boundary 
conditions  are  described  by  the  phases. 


&0j  =  ^Ij  + 
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Fig.  3.  The  qij  configuration  (left)  for  the  relaxed  states 
(t  =  10^)  at  different  temperatures  and  the  spectral  power 
(right)  |^(*)|.  Qij  >  0  -  black;  q,j  <0  -  white.  The  color 
code  for  the  spectral  power  is  log,o ( |g (* ) |^ )  >  1 0^  -  black; 
10*  <  log,o(|g(*)p)  <  10*  -  grey;  log,o(|g(*)|^)  <  10*  - 
white,  (a)  T  =  0.01;  (b)  T  =  0.25;  (c)  T  =  0.45. 


(a)  (b)  (C) 


i  i  i  i  M  i 


Fig.  4.  Annihilation  of  a  thermally  nucleated  (T  =  0.25) 
vortex-antivortex  pair  on  a  domain  boundary.  The  plots 
show  1 6  X  1 6  of  the  total  1 28  x  1 28  system  for  three  different 
times  (/»</(,<  -  tc  w  2).  The  qij  configuration  is 

shown  as  in  fig.  2  (left). 


+  =  ^N,j  +  ^N:,J.N^+ijy 

^iO  —  ^/l  + 

0i^,  + 1  =  ^i/i/,  +  A  + 1 .  (  7  ) 

These  boundary  conditions  will  result  in  zero 
current  flow  through  the  edges  -  except  the  cur¬ 
rent  supplied  by  the  external  source,  J\ .  We  have 
studied  the  system  as  deflned  above  for  the  dis¬ 
sipation  parameter  tj  =  5  and  the  external  mag¬ 
netic  field  /  =  j.  In  fig.  5  we  show  the  current- 
voltage  (IV)  characteristics  of  the  system.  The 
current  is  supplied  at  the  vertical  edges  of  the 
system  and  the  resulting  dc  voltages,  Vx  and  Vy, 
are  measured  and  displayed  in  fig.  Sa  and  fig. 
5b,  respectively.  The  voltages  are  defined  in  the 
following  way. 


Vx 


N. 


(8) 


At  the  macroscopic  level  of  the  J\-Vx  curve  we 
see  no  essential  differences  from  the  periodic 
boundary  conditions  [3].  Examination  of  the 
dynamics  reveals  the  presence  of  a  window  of 
noisy  response  for  Jc(f  =  j)  ^  J\  ;S  0.55. 
This  multi  time-scale  response  is  understood 
in  terms  of  irregular  domain  wall  separations, 
shown  in  the  insets  (7]  =  0.4)  in  fig.  5a,  where 
the  Qij  distribution  is  displayed.  Figure  5b  shows 
that  a  large  transversal  voltage  is  uniquely  asso¬ 
ciated  with  the  ‘chaotic’  regime  of  the  dynam¬ 
ics.  Indeed,  we  see  that  the  amplitude  of  Vy  is 
largest  in  the  most  noisy  regime  (lowest  J\ )  and 
vanishes  in  the  periodic  regime.  The  origin  of 
the  transversal  voltage  generation  can  be  under¬ 
stood  from  the  insets  in  fig.  5.  In  fig.  5b  the  solid 
curves  show  the  time  evolution  of  the  voltage, 

2 


and  the  dotted  curves  show  the  time  evolution 
of  the  voltage. 
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Fig.  S.  The  voltages  across  the  JJA  as  a  function  of  the  applied  bias  current,  (a)  The  longitudinal  dc  voltage.  The  insets  show 
the  Qij  configuration  (color  code  as  in  fig.  2)  for  two  characteristically  different  cases.  The  bias  current  is  forced  through 
the  system  along  the  horizontal  axis,  (b)  The  amplitude  of  the  transversal  voltage.  Insets:  time  evolution  of  the  transversal 
voltages,  Vy  (dashed),  (solid),  and  (dotted). 


2 

=  w  E  (10) 

i  =  N,/2+\ 

and  the  dashed  curves  show  the  sum,  Vy.  In  the 
periodic  regime  we  find  large  oscillations  for 
Vy‘^  and  Vy'^  (see  fig.  5b  for  Ji  =  0.7),  but 
since  they  are  of  equal  size  and  opposite  sign, 
the  total  transversal  voltage  is  zero.  For  the 
chaotic  regime,  on  the  other  hand,  the  dynami¬ 
cally  induced  spatially  disorder  (see  fig.  Sa  for 
Ji  =  0.4)  makes  the  two  contributions  unequal 
in  size,  and  hence  the  sum,  Vy,  starts  to  oscil¬ 
late  as  shown  in  fig.  5b  (inset  for  Ji  =  0.4). 
Furthermore,  as  suggested  in  the  inset  of  fig.  5a, 
and  as  we  have  observed  through  careful  exam¬ 
ination  of  the  time  evolution,  the  asymmetry 
between  the  left  and  right  half  planes  develops 
by  local  flux  defects  nucleating  at  the  bound¬ 
aries  and  propagating  into  the  interior.  It  is  here 
important  to  note  that  the  dynamically  induced 
transversal  voltage  response  to  the  longitudinal 
dc  current  is  purely  oscillating  and  has  no  dc 
value.  Detailed  studies  [5]  of  the  power  spectra 
of  the  longitudinal  and  transversal  voltages  show 
that  the  frequencies  of  Vx  and  Vy  are  closely  re¬ 
lated.  Not  surprisingly,  the  Josephson  frequency 
is  present  in  the  signals,  but  also  a  large  fre¬ 


quency  contribution  arises  from  the  injection  of 
domain  walls  at  the  driven  edges.  Further,  the 
breaking  of  the  symmetry  in  the  chaotic  regime 
results  in  an  exact  half  harmonic  generation, 
since  each  half  of  the  system  now  contributes 
separately.  Let  us  conclude  this  section  by  not¬ 
ing  that  the  symmetry  breaking  and  transverse 
voltage  generation  has  been  observed  for  a  dis¬ 
ordered  system  as  well  [5]  and  the  results  of  the 
dynamically  induced  disorder  and  the  positional 
disorder  are  qualitatively  very  similar. 


4.  Conclusion 

We  have  used  the  data-parallel  programming 
model  to  study  the  dynamics  of  large  frustrated 
arrays  of  Josephson  junctions  on  the  CM-2.  This 
approach  allow  for  a  flexible  and  intuitive  im¬ 
plementation  of  the  basic  dynamical  equations. 
Our  study  reveals  an  interesting  relation  between 
spatial  and  temporal  scales,  including  relaxation 
from  random  flux  configurations  which  exhibit 
a  stretched-exponential  behavior.  The  dynamics 
of  the  noisy  response  is  controlled  by  “defects” 
structures  -  vortices  and  domain  walls,  which  are 
defined  with  respect  to  the  underlying  ground- 
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State  flux  structure.  The  Josephson  junction  ar¬ 
rays  are  ideal  laboratory  scale  systems  for  study¬ 
ing  the  complexity  of  spatio-temporal  nonlinear 
phenomena.  Using  data-parallel  computing  on 
the  new  massively  parallel  computers  we  are  now 
in  a  position  to  model  these  systems  in  a  one-to- 
one  correspondence. 

We  thank  the  Los  Alamos  Advanced  Com¬ 
puting  Laboratory  for  generous  support  and  for 
making  their  facilities  available  to  us.  This  work 
was  performed  under  the  auspices  of  the  US  De¬ 
partment  of  Energy. 
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Non-linear  properties  of  Josephson  junctions 

N.F.  Pedersen 

Physics  Laboratory  /,  The  Technical  University  of  Denmark,  DK-2800  Lyngby,  Denmark 


The  non-linear  properties  of  Josephson  junctions  are  reviewed.  Those  properties  include  ihc  possibility  to  have  soliton 
propagation  in  a  Josephson  junction  transmission  line,  i.e.,  a  very  ordered  state.  The  non-lineai  properties  also  include  the 
possibility  for  chaos  in  many  different  Josephson  junction  systems,  i.e.,  a  very  non-ordered  state.  Several  examples  from 
experiments  are  given. 


1.  Introduction 

The  Josephson  junction  is  a  unique  non-linear 
system  for  several  reasons.  Firstly  one  may  find 
most  of  the  generic  non-linear  behaviour  in  vari¬ 
ous  Josephson  junction  systems.  For  example 
chaos  transitions  of  several  different  kinds  may 
be  found,  including  the  famous  period  doubling 
route  to  chaos.  Another  non-linear  phenomenon 
such  as  soliton  propagation  may  also  be  found  in 
a  spatially  extended  Josephson  junction,  which  is 
described  by  the  socalled  perturbed  sine-Gordon 
equation  [1].  Also  other  typical  non-linear  phe¬ 
nomena  such  as  phase-locking,  subharmonic  gen¬ 
eration,  phase  transitions  etc.  may  be  found  in 
Josephson  junction  systems. 

Secondly  a  most  important  aspect  of  the  non¬ 
linear  behaviour  of  Josephson  junctions  is  that 
most  of  the  above  mentioned  phenomena  are 
accessible  for  experimental  measurements.  This 
has  given  a  unique  |X)ssibility  of  cooperation 
between  researchers  working  with  computer 
simulations  on  one  side  and  researchers  doing 
experimental  work  on  the  other.  The  benefit  of 
such  crossfertilisation  has  produced  results  that 
through  the  mutual  interaction  by  far  exceed  the 
sum  of  the  individual  contributions.  I  hope  to 
convincingly  demonstrate  this  unique  feature  of 


Josephson  junction  physics  in  the  examples  pre¬ 
sented  on  the  following  pages.  The  paper  is 
organized  in  the  following  way.  In  section  2  the 
autonomous  Josephson  is  described.  In  section  3 
we  discuss  soliton  propagation  with  one  subsec¬ 
tion  describing  the  theory  and  numerical  simula¬ 
tions  and  the  other  describing  experiments. 
Chaos  is  discussed  in  section  4  with  a  similar 
subdivision  between  simulations  and  experi¬ 
ments.  Conclusions  are  found  in  section  S. 


2.  The  autonomous  Josephson  junction 

The  most  widely  investigated  Josephson  junc¬ 
tion  system  is  the  current-driven  Josephson  junc¬ 
tion,  a  description  of  which  may  be  found  in 
recent  books  on  the  subject  [2,3].  The  Josephson 
junction  consists  of  two  superconducting  elec¬ 
trodes  separated  by  a  tunneling  barrier.  The 
equations  with  a  bias  current  may  be  written 


CdV  V  ,  . 

+  /o  sin  <p  =  /de  . 

d^  ^  2eV 
At~  h  ' 


(la) 

(lb) 
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Fig.  I.  Equivalent  diagram  (a)  and  schematic  drawing  of  a 
tong  Josephson  junction. 


Equation  (lb)  is  the  famous  Josephson  fre- 
quency-to-voltage  relation  and  eq.  (la)  is  Kir- 
choffs  law  applied  to  the  Josephson  junction 
equivalent  circuit,  a  section  as  shown  in  hg.  la. 
Here  <p  is  the  pair  phase  difference  across  the 
junction,  and  /„  is  the  maximum  pair  current.  In 
addition,  a  shunt  resistance  R  carries  a  normal 
electron  current  (V/R),  a  capacitance  C  carries  a 
capacitive  current  C(dV/dt),  h  is  Planck’s  con¬ 
stant  and  e  is  the  electron  charge.  With  time 
normalized  to  the  reciprocal  plasma  frequency 
o}^' =  {hCI2eI„y''  and  current  normalized  to 
the  critical  current  /(,,  these  equations  may  be 
combined  into  a  single  dimensionless  equation 
(the  pendulum  equation), 

<Pt,  +  a<P,  +  sin  =  Tj .  (2) 

The  plasma  frequency,  w„,  is  the  natural  oscilla¬ 
tion  frequency  for  the  Josephson  junction,  corre¬ 
sponding  to  the  oscillation  frequency  in  a  pen¬ 
dulum.  The  McCumber  parameter  is  given  by 
/Sf  =  2eR^I„Clh,  and  the  damping  parameter  a  is 
given  by  (1/V^).  tj  is  the  normalized  (to  4)  dc 
bias  current. 


3.  Soliton  propagation 

3.1.  Long  Josephson  junctions:  spatial 
dependence 

As  an  extension  of  the  simple  Josephson  junc¬ 
tion  let  us  now  consider  what  happens  when  a 
spatial  variation  of  the  pair  phase  is  allowed.  We 
may  get  a  soliton.  Indeed,  the  long  Josephson 
junction  is  one  of  the  physical  systems  where 


soliton  propagation  is  accessible  for  direct  ex¬ 
perimental  measurements.  For  the  purpose  of 
this  presentation  it  is  sufficient  to  note  that  the 
physical  manifestation  of  the  soliton  is  a  fluxon, 
i.e.,  a  quantum  of  magnetic  flux  = 
Ji/2e  =  2.064  X  lO”*' Vs.  Moving  fluxons  in  the 
long  Josephson  junction  manifest  themselves  as 
the  so-called  zero  field  steps  (ZFS)  in  the  dc 
current-voltage  characteristic.  Figure  la  shows 
the  equivalent  diagram  and  fig.  lb  the  geometry 
of  a  long  junction  of  the  overlap  type.  The 
corresponding  wave  equation  may  be  written  as 
a  rather  simple  extension  of  eq.  (2); 

~^xx  +  ‘P,i  +  sin<p  =  T)-a(p,  ,  (3) 

with  (normalized)  voltage  V  =  tp,  (as  in  eq.  (lb)) 
and  (normalized)  current  /  =  -<p^.  The  addition¬ 
al  normalizations  used  are  as  follows.  Length  is 
measured  in  units  of  the  Josephson  penetration 
depth  Aj  =  \J{hl2dep^^J),  i.e.,  /  =  L/Aj.  J  is  the 
current  density,  and  d  is  the  magnetic  thickness 
of  the  junction.  The  junction  length,  L,  is  as¬ 
sumed  large,  and  the  width,  W,  is  assumed  small 
compared  to  the  Josephson  penetration  depth. 
Finally,  velocities  become  normalized  to  the  ve¬ 
locity  of  light  in  the  barrier,  c.  For  typical  ex¬ 
perimental  junctions  (of  “old”  superconductors), 
c  is  a  few  percent  of  its  value  in  vacuum. 

3.1.1.  Perturbation  calculation  for  the  infinite 
Josephson  line 

The  methods  in  this  section  are  based  mainly 
on  the  work  of  McLaughlin  and  Scott  [1].  With 
the  right  hand  side  equal  to  zero,  eq.  (3)  is  the 
sine-Gordon  equation.  The  loss  and  bias  terms 
on  the  right  hand  side  are  considered  as  a  pertur¬ 
bation  to  the  sine-Gordon  equation.  The  un¬ 
perturbed  sine-Gordon  equation  has  the  well 
known  analytical  single  soliton  solution  [1,4], 

(p  =  4tan“'e*,  (4) 

where  6  =  (x  -  ut)y(u),  and  y(u)  =  l/yjl  -  u^) 
is  the  Lorentz  factor.  The  solution  gives  rise  to  a 
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Itt  phase  shift  over  a  length  of  a  few  Aj,  and  its 
derivative  tp,  represents  a  voltage  pulse.  Note 
that  the  form  of  the  solution  -  a  traveling  wave 
in  the  parameter  9  =  y(u)  (x  -  ut)  -  is  a  con¬ 
sequence  of  the  Lorentz  invariance  of  the  sine- 
Gordon  equation.  In  eq.  (4)  the  velocity  u  is  a 
free  parameter.  As  shown  in  refs.  [1,4]  the  per¬ 
turbation  terms  are  included  by  assuming  a  solu¬ 
tion  of  the  same  form  as  that  in  eq.  (4),  but  with 
u  to  be  determined  by  a  power  balance  equation. 
Requiring  either  the  Hamiltonian  or  the  momen¬ 
tum  to  be  independent  of  time,  one  finds  the 
velocity  to  be  determined  by  [1,4] 


A  more  complicated  solution  to  the  sine- 
Gordon  equation  that  may  be  perturbed  under 
the  influence  of  bias  and  losses  is  the  soliton  - 
antisoliton  solution  that  may  be  written  [1] 

(p  =  4tan'‘(sinh  r/u  cosh  A") ,  (6) 

where  T  =  uy{u)  t  and  X  =  yiu)  x.  It  is  possible 
in  a  manner  similar  to  that  for  the  single  soliton 
case  to  perform  a  power  balance  calculation  by 
requiring  the  time  rate  of  change  of  the  energy  H 
to  be  zero,  i.e.,  calculating  the  integral  [4-6] 

irpp,-a(p^;)dx,  (7) 

with  <p  inserted  from  eq.  (6).  For  this  case  a 
qualitatively  new  phenomenon  occurs.  For  high 
incident  energies  the  soliton  and  antisoliton  will 
pass  through  each  other  with  a  phaseshift  8 
(spatial  advance)  given  by 

5  =  -2(1  -  M')  In  u  .  (8) 

For  bias  below  a  certain  threshold,  to  be 
calculated  below,  the  soliton  and  antisoliton  will 
annihilate  each  other,  create  a  breather  mode, 
and  eventually  die  out  as  small  amplitude 
damped  plasma  oscillations.  In  evaluating  eq.  (7) 


Fig.  2.  Fluxon-antifluxon  collision,  a  =  0.2.  p  =  0.22.  /  =  40. 

one  finds  after  rather  lengthy  calculations  [6]  that 
the  collision  gives  rise  to  an  energy  loss  A//, 
given  by  [6] 

(9) 

Part  of  this  energy  is  dissipated  in  propagating, 
but  decaying,  oscillations  of  the  line.  The  annihi¬ 
lation  threshold  [6]  may  be  found  by  requiring 
that  the  total  energy  of  the  soliton  and  anti¬ 
soliton  before  the  collision,  H  =  16y(M),  is  equal 
to  the  energy  loss  plus  the  rest  energy  of  a 
stationary  soliton  and  antisoliton,  i.e., 

16y(M)  =  4TT'a  -I-  16y(0) .  ( 10) 

Equation  (10)  together  with  eq.  (5)  leads  to  [6] 

T?,H«(2a)"^  (11) 

Figure  2  shows  a  numerical  calculation  of  a 
soliton  -  antisoliton  collision  where  both  of  the 
above  phenomena  -  the  energy  loss  and  the 
phase  shift  -  are  easily  observed. 

3.1.2.  The  overlap  and  annular  Josephson  trans¬ 
mission  line 

In  the  overlap  junction  (fig.  lb)  the  long  di¬ 
mension  of  the  junction  is  perpendicular  to  the 
current  direction  and  the  bias  current  is  uniform¬ 
ly  distributed  over  the  junction  length.  Hence 
rf  =  IJJWL  may  be  assumed  in  eq.  (3).  Due  to 
the  moving  fluxon,  a  phase  shift  of  2tt  takes 
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place  in  a  time  interval  l/u,  where  /  is  the 
(normalized)  length  of  the  junction.  This,  in 
turn,  gives  rise  to  a  dc  voltage  v,  given  by 

y  =  (2ir//)M.  (12) 

The  overlap  junction  has  boundary  conditions 
requiring  that  no  currents  flow  out  at  the  ends, 
i.e., 

<Px(0,0  =  <P;.(/,0  =  0.  (13) 

It  may  be  shown  that  this  boundary  condition 
is  mathematically  equivalent  to  a  soliton  - 
antisoliton  collision,  which  was  treated  in  the 
previous  section. 

The  annular  geometry  on  the  other  hand  looks 
mostly  like  a  long  overlap  Josephson  junction 
that  is  folded  back  into  itself;  it  has  the  simple 
periodic  boundary  conditions 

‘Pa-(0*  0  =  <Px(^  0  +  2pTT ,  (14) 

which  for  its  simplicity  is  favoured  by  many 
authors.  Here  p  gives  the  number  of  full  phase 
rotations  along  the  line.  For  topological  reasons 
p  is  a  conserved  number  as  long  as  the  supercon¬ 
ducting  ring  is  not  opened  up  by  taking  the 
junction  through  the  transition  temperature  [7- 
10],  or  by  local  hearing  with  an  e-beam  [11,12]. 

The  simplest  case  to  consider  is  p  =  1,  i.e.  a 
single  fluxon  on  tbe  circular  line.  This  case  is 
shown  in  flg.  3a  for  the  equivalent  closed  pen¬ 
dulum  array  [9].  In  that  case  there  is  no  super¬ 
current,  since  as  soon  as  a  uniform  bias  current  is 
applied,  the  fluxon  starts  moving  with  a  velocity 
u  determined  by  eq.  (5)  and  a  voltage  given  by 
eq.  (12)  develops.  Hence  for  the  annular  junc¬ 
tion  with  one  trapped  fluxon  no  supercurrent 
exists  and  the  dc  voltage  is  a  direct  measurement 
of  the  velocity  of  the  single  soliton.  The  IV  curve 
is  shown  qualitatively  in  fig.  3b.  In  addition  to 
the  single  soliton,  further  solitons  may  be 
created  only  by  introducing  soliton  -  antisoliton 
pairs  (for  topological  reasons),  in  which  case  the 


Fig.  3.  (a)  Equivalent  pendulum  array  for  the  annular 
Josephson  with  one  fluxon  (from  ref.  [9]).  (b)  Schematic  IV 
curves:  Annular  junction  p  =  1. 


effects  of  collisions  must  be  taken  into  account. 
Disregarding  the  collisions  for  simplicity,  the 
voltages  of  these  different  ZFS  are  given  by 
multiplying  the  one  fluxon  voltage  with  the  total 
number  n  of  fluxons  and  antifluxons,  i.e.,  voltage 
steps  are  to  be  exfjected  at  voltages  y,,  Uj  =  3u,, 
U5  =  5y,,  etc.  (shown  as  the  dashed  curves  in  fig. 
3b).  For  the  higher  order  branches,  v^,  v^, . . . 
the  qualitative  effect  of  the  collisions  is  to  lower 
the  average  velocity  and  voltage  somewhat  com¬ 
pared  to  /lu,,  where  /i  =  3,  5,  7, . . .  .  Also  a 
lower  bias  threshold  17,^,  where  a  fluxon  and  an 
antifluxon  annihilate  each  other  (cf.  eq.  (11)),  is 
to  be  expected.  Figure  3b  shows  qualitatively  the 
higher  order  steps  (full  curves)  based  on  these 
arguments.  Reference  [8]  shows  for  p  =  2  a  nu¬ 
merically  simulated  IV  curve  which  contains  all 
the  essential  features  of  flg.  3b. 

3.2.  Soliton  experiments:  dc  IV-curves 

In  the  IV  curve  of  an  overlap  junction  the 
moving  soliton  gives  rise  to  zero  field  steps 
(ZFS).  The  mechanism  for  the  first  ZFS,  n  =  \, 
is  that  a  fluxon  moves  along  the  junction  and  is 
reflected  at  the  boundary  as  an  antifluxon.  Since 
the  reflection  at  x  =  /  is  equivalent  to  a  collision 
with  a  virtual  antifluxon  at  x  =  I,  the  problem 
may  be  treated  in  the  framework  of  eq.  (8)  for 
the  phase  shift  and  eq.  (9)  for  the  energy  loss.  If 
the  junction  length  /  is  very  large,  the  details  at 
the  boundaries  play  only  a  minor  role,  and  the 
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Fig.  4.  (a)  Experimental  zero  field  steps  for  an  overlap 
junction.  (After  ref.  (5|).  (b)  Experimental  n-fluxon  zero 
field  steps  on  an  annular  junction,  n  =  1-9  (after  refs. 
[11,121). 

voltage  of  the  first  step  is  given  by  eq.  (12).  For 
example,  Pedersen  and  Weiner  [5]  were  able  to 
neglect  completely  ^he  effects  of  collisions  in  a 
comparison  between  experimental  ZFS  on  a  very 
long  overlap  junction  (/  =  45)  and  perturbation 
theory.  Figure  4a  shows  an  example  of  one  of 
their  experimental  curves.  If  the  junction  length 
is  smaller  (for  example  of  the  order  5-10),  the 
energy  loss  and  the  phase  shift  will  give  rise  to 
corrections. 

Experimental  measurements  of  solitons  on  the 
annular  junction  [9,10]  have  also  been  reported. 
In  the  experiment  the  p  value  could  be  changed 
only  by  taking  the  junction  through  the  transi¬ 
tion  temperature,  p  =  1  appeared  qualitatively  as 
discussed  above,  p  =  0  (zero  fluxons  trapped) 
showed  the  full  supercurrent  and  fluxon  -  anti- 
fluxon  steps  at  voltages  V2^2v^,  i;4=4i;,... 
etc. 

In  a  more  recent  experiment  [11,12]  the  intro¬ 
duction  of  fluxons  could  be  made  in  a  much 
more  controlled  way.  Here  an  electron  beam  was 
used  to  open  one  side  of  the  annular  junction 
and  introduce  a  fluxon.  Each  time  the  electron 
beam  was  crossing  the  junction  a  new  fluxon  was 
introduced.  Passing  the  beam  in  the  opposite 
direction  removed  a  fluxon.  Figure  4b  shows  an 
IV  curve  from  this  experiment  [11,12],  in  which 
successively  one  up  to  nine  fluxons  were  intro¬ 
duced.  The  annular  junction  experiment  dem¬ 
onstrates  in  a  very  clean  way  the  existence  of  a 
topological  sine-Gordon  soliton  with  a  phase 


change  of  2it.  The  experiment  and  interpretation 
are  elegantly  connected  with  fundamental  theory 
which  requires  the  phase  to  change  only  in  miuti- 
ples  of  2-17  around  a  superconducting  ring. 

4.  Chaos 

4.1.  Chaos  in  Josephson  junctions 

Another  non-linear  signature  appearing  quite 
often  in  Josephson  junctions  is  chaos  together 
with  its  accompanying  bifurcations.  In  fact,  quite 
often  the  Josephson  junction  is  used  as  a  model 
system  for  chaos  in  numerical  simulations.  A 
particular  feature  of  chaos  in  Josephson  junc¬ 
tions  is  that  both  the  effect  of  thermal  noise  and 
the  effect  of  deterministic  noise  (chaos)  are  very 
important  in  relation  to  the  experiments.  The 
interplay  between  those  two  sources  of  noise  is 
very  complicated,  and  at  worst  makes  it  difficult 
to  interpret  experiments.  This  has  led  to  new 
theoretical  and  numerical  work  on  the  non-linear 
interaction  between  thermal  and  deterministic 
noise. 

4  1.1.  Deterministic  chaos  in  the  Josephson 
junction 

The  most  widely  investigated  chaotic  Joseph¬ 
son  junction  system  is  the  rf-driven  Josephson 
junction  for  which  the  equation  may  be  obtained 
by  adding  a  term  /,f  sin  tor  to  eq.  (la),  i.e.,  in 
normalized  units  (/2  =  utlto^) 

<p„  +  atp,  +  sin  <p  =  Tjo  -t-  tj,  sin  Qt .  (15) 

Since  analytical  solutions  do  not  exist,  one  has 
to  do  numerical  simulations  [13-18]  in  the  pa¬ 
rameter  space  of  a,  t)„,  tj,  ,  and  Ji.  A  particularly 
thorough  investigation  of  the  parameter  space 
was  done  in  ref.  [18].  Typically,  the  system  has 
been  investigated  numerically  in  the  77,-12  plane 
for  a  fixed  damping  parameter  a,  a  plot  which 
more  or  less  has  become  a  standard  for  such 
systems.  Such  a  parameter  plane  shows  a  very 


32 


N.  F.  Pedersen  /  Nonlinear  properties  of  Josephson  junctions 


complicated  mapping  of  different  dynamical  be¬ 
haviour  [14|.  For  oi  >  ojo,  i.e.,  f2>l,  the 
threshold  rises  because  the  capacitor  shorts  out 
the  applied  rf  current.  For  &)<l/f?C,  i.e., 
17  <  1  /  V^,  the  system  is  able  to  follow  adiabati- 
cally  the  rf  current,  and  chaos  occurs  only  if 
Tj,  >1.  For  (o  ~  coq,  the  threshold  for  chaos  is 
lowest. 

Besides  numerical  simulation  another  method 
of  a  somewhat  computational  nature  is  to  use  an 
electronic  analog  simulating  the  Josephson  equa¬ 
tion  [19].  Such  systems  have  the  advantage  of 
being  very  fast,  and  Poincare  sections  and  bifur¬ 
cation  diagrams  may  be  readily  displayed.  The 
disadvantage  is  the  limited  precision  and  res¬ 
olution,  and  the  drift  of  analog  electronic  cir¬ 
cuits. 

4.1.2.  Thermally  affected  chaos  in  the  Josephson 
junction 

In  ref.  [20]  a  Josephson  junction  system  with 
parameters  such  that  two  solutions  existed,  was 
investigated.  The  authors  found  that  the  basin 
boundaries  between  the  two  solutions  were  frac¬ 
tal.  A  small  amount  of  thermal  noise  may  then 
take  the  system  back  and  fourth  between  the 
solutions.  The  authors  found  that  this  mecha¬ 
nism  gave  rise  to  approximately  1//  noise  for 
some  parameter  regions. 

In  an  extensive  numerical  simulations  [21]  in¬ 
cluding  a  thermal  noise  term  in  eq.  (2),  Kautz 
was  able  to  obtain  the  very  high  noise  tempera¬ 
tures  (-10*^  K)  that  have  been  observed  ex¬ 
perimentally.  For  a  situation  with  overlapping 
rf-induced  steps,  the  origin  of  the  very  high  noise 
temperatures  was  hopping  between  phase  locked 
and  metastable  chaotic  states  induced  by  thermal 
fluctuations.  This  may  even  lead  to  the  result 
that  the  low  frequency  noise  power  increases  as 
the  temperature  is  reduced.  In  the  absence  of 
thermal  noise,  numerical  calculations  of  chaotic 
regions  in  the  IV  curve  typically  contain  a  wealth 
of  complicated  structure  displaying  bifurcations, 
chaos,  periodic  solutions,  etc.  This  may  be  seen 
in  fig.  5,  which  shows  a  numerical  calculation  of 
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Fig.  5.  Numerical  calculation  of  phase  locking  steps  of  an  IV 
curve.  /i  =  0.16,  a  =0.5,  rj,  =  1.05.  After  ref.  [22]. 


an  rf-induced  step  with  loss  of  phase  lock.  In 
experiments  that  have  been  performed,  such  in¬ 
teresting  and  complicated  structure  have  been 
typically  washed  out  because  of  thermal  noise, 
and  only  a  smooth  curve  which  does  not  in  a 
simple  and  convincing  way  demonstrate  chaos,  is 
obtained  [22].  Thermal  smoothing  due  to  a  tem¬ 
perature  of  less  than  100  mK  is  sufficient  to 
remove  most  of  the  traces  of  complicated  dy¬ 
namical  behaviour.  By  comparing  experiments 
with  a  calculation  that  includes  thermal  noise, 
however,  the  existence  of  chaos  may  be  shown 
indirectly. 

4.2.  Experiments  on  real  Josephson  junction 

Common  to  all  the  experimental  results  is  that 
they  are  not  nearly  as  spectacular  as  the  numeri¬ 
cal  simulations.  The  main  reason  is  that  thermal 
noise,  which  is  most  often  not  taken  into  account 
in  simulations,  has  a  major  effect  on  the  out¬ 
come  of  the  experiments.  This  is  because  the 
energy  levels  in  the  thermal  oscillations  may  very 
well  be  of  the  same  order  of  magnitude  as  the 
intrinsic  energy  levels  in  the  Josephson  junc¬ 
tions,  and  complicated  non-linear  interactions 
occur. 
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4.2.1.  dc  observation  of  chaos 

Before  the  term  chaos  was  connected  to 
Josephson  junctions,  researchers  sometime 
noted  very  irregular  and  erratic  IV  curves  in 
samples  subject  to  strong  applied  rf  signals.  In 
many  cases  such  junctions  were  discarded  be¬ 
cause  of  assumed  defects  during  fabrication.  It  is 
known  that  such  irregular  IV  curves  may  be  a 
signature  of  chaos.  Examples  of  such  irregular 
behaviour,  in  particular  the  loss  of  phaselock  on 
an  rf-induced  step,  may  be  found  in  refs.  [22- 
24]. 

By  comparing  such  experimentally  obtained, 
irregular  IV  curves  to  numerically  obtained  ones, 
one  has  in  principle  the  simplest  experiment  on 
chaos.  An  example  of  such  a  dc  experiment  is 
illustrated  in  fig.  6,  which  shows  an  experimen¬ 
tally  obtained  IV  curve  with  loss  of  phaselock  on 
the  rf-induced  step.  Also  shown  in  the  figure  is 
the  spectrum  of  half-harmonic  generation  as 
measured  with  a  sensitive  microwave  spectrome¬ 
ter.  Note  that  these  experimental  curves,  which 
contain  two  period-doubling  bifurcations  and  a 
chaotic  region  on  an  rf-induced  step,  can  be 
considered  as  a  standard  example  of  the  period¬ 
doubling  route  to  chaos.  There  experimental  re- 


Fig.  6.  IV  curve  a  and  half  harmonic  microwave  power. 
Experiments  after  ref.  (24]. 


suits  are  very  similar  to  the  numerical  results 
shown  in  fig.  1  of  ref.  [17]  and  to  analog  results. 

4.2.2.  Chaos  and  parametric  amplification 

For  Josephson  junction  parametric  amplifiers, 
low  noise  temperatures  were  found  in  some 
cases,  however,  more  often  experiments  showed 
considerable  excess  noise  [25].  For  experimental¬ 
ists  the  observed  noise  rise  has  been  a  major 
puzzle.  A  large  number  of  theoretical  and  nu¬ 
merical  papers  [26]  have  dealt  with  the  problem. 
The  conclusion  is  that  the  very  large  noise  tem¬ 
peratures  cannot  be  explained  by  traditional 
noise  sources  such  as  Johnson  noise,  shot  noise, 
or  quantum  noise.  Hubermann  et  al.  [13]  first 
suggested  that  chaos  was  the  origin  of  the  excess 
noise.  This  suggestion  was  further  substantiated 
in  ref.  [14]. 

More  recently  a  slightly  different  type  of 
Josephson  junction  parametric  amplifier  was  in¬ 
vestigated  [27]  by  another  group,  which  drew  the 
conclusion  that  noise  in  this  amplifier  cannot 
arise  from  deterministic  chaos  alone.  The  ob¬ 
served  noise  increase  required  the  presence  of 
thermal  noise.  They  suggested  that  the  noise 
increase  was  due  to  thermally  induced  hopping 
between  a  bias  point  that  would  be  stable  in  the 
absence  of  thermal  noise  and  an  unstable  point. 
This  observation  demonstrates  the  importance  of 
thermal  noise  in  modeling  chaos  in  Josephson 
junction  systems.  A  noise  temperature  of  as 
much  as  10^  K  may  be  obtained. 

5.  Conclusion 

This  paper  has  discussed  mainly  the  non-linear 
properties  of  Josephson  junctions,  which  have  so 
much  promise  for  both  applications  and  con¬ 
tinued  research  on  fundamental  problems.  The 
problems  we  have  dealt  with  have  all  been  de¬ 
fined  on  the  basis  of  the  “old”  superconductors. 
Future  work  involving  the  new  high  supercon¬ 
ductors  will  most  likely  deal  not  only  with  the 
same  type  of  problems  as  discussed  above  for 
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different  parameters,  but  also  with  completely 
new  non-linear  phenomena,  for  example,  due  to 
anisotropy  and  complicated  flux  dynamics.  It 
may  be  safely  predicted  that  a  lot  of  interesting 
non-linear  physics  lies  ahead. 
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Josephson  soliton  oscillators  integrated  in  a  resonator  consisting  of  two  closely  spaced  superconducting  microstrips 
have  been  investigated  experimentally.  The  dc  /K-characteristic  shows  zero-field  steps  containing  a  number  of  resonator- 
induced  substeps,  occurring  because  of  locking  of  the  soliton  oscillator  to  the  resonator.  The  position  of  the  substeps 
follows  the  structure  of  a  “devil's  staircase”,  and  their  appearance  depends  strongly  on  the  externally  applied  dc 
magnetic  field. 


1.  Introduction 

The  study  of  the  dynamical  behaviour  of  long 
Josephson  junctions  is  of  great  fundamental  and 
practical  interest.  Besides  showing  a  rich  varia¬ 
tion  of  nonlinear  properties  and  chaotic  regimes, 
they  have  potential  applications  as  electronic  de¬ 
vices,  such  as  local  oscillators  for  SlS-mixers  in 
the  realization  of  cryogenic  integrated  millime¬ 
ter  wave  receivers  for  use  in  radio-astronomy 
and  space  communication  [  1  ]. 

The  one-dimensional  Josephson  soliton  os¬ 
cillator  is  a  Josephson  tunnel  junction  which 
is  much  longer  than  the  Josephson  penetration 
depth,  Aj,  in  one  direction  and  comparable  to 
or  less  than  Aj  in  the  other  direction.  When  the 
junction  is  properly  dc  biased,  a  localized  quan¬ 
tum  of  magnetic  field  -  a  soliton  -  can  oscillate 
back  and  forth  in  the  junction,  hereby  emit¬ 
ting  radiation  in  the  millimeter  wave  frequency 
range  [2]. 

The  junctions  discussed  here  are  integrated 
in  a  resonant  structure  consisting  of  two  closely 
spaced  coplanar  superconducting  microstrips  as 


Fig.  1.  Schematic  drawing  of  the  sample  geometry.  The 
coplanar  resonator  has  its  electromagnetic  field  conflned 
in  the  gap  between  the  two  closely  spaced  Nb-strips.  The 
two  Josephson  junctions  are  formed  where  the  top  and  bot¬ 
tom  films  of  the  left  strip  overlap.  The  microstrip  antenna 
shown  at  the  top  of  the  figure  is  used  for  the  microwave 
measurements. 

shown  in  fig.  1.  Earlier  experiments  [3]  have 
shown  locking  between  one  soliton  oscillator  and 
the  resonator  manifesting  itself  as  substeps  on 
the  zero-field  steps  in  the  /  ^-characteristic.  Here 
we  report  on  a  more  detailed  study  of  these  sub¬ 
steps  in  order  to  clearify  the  nature  of  the  inter¬ 
action  between  the  soliton  oscillator  and  the  res¬ 
onator. 
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Fig.  2.  Lower  part  of  the  ZFSl  for  a  four-junction  sample 
(U1-4JJ).  T  =  4.2K,  B  =  0. 

2.  Experiments 

The  samples  were  fabricated  in  Nb/AlOx/Nb 
trilayer  technology  with  a  critical  current  density 
of  10^  A/cm^  as  described  elsewhere  [4].  The  di¬ 
mensions  of  the  junctions  are  200  fim  by  10  //m 
and  they  are  closely  spaced  in  pairs  placed  in  the 
middle  of  one  of  the  Nb-strips  (see  fig.  1 ).  The 
critical  current  was  10  mA.  The  Josephson  pen¬ 
etration  depth  is  Aj  ~  10  fim  giving  a  maximum 
plasma  frequency  of  ~  130  GHz.  The  maxi¬ 
mum  voltage  of  the  first  zero-field  step  (ZFSl ) 
is  approximately  1 00  fiV  corresponding  to  a  soli¬ 
ton  oscillator  frequency  of  24  GHz.  The  varia¬ 
tion  in  the  measured  parameters  was  less  than 
5%. 

Fig.  2  shows  the  lower  part  of  ZFS 1  for  one  of 
the  junctions  in  a  four-junction  sample.  A  hys- 
teretic  step  at  79  fi\,  corresponding  to  a  soliton 
frequency  of  1 9  GHz,  is  present,  but  a  series  of 
minor  steps  can  also  be  seen  at  lower  voltages.  All 
the  steps  are  interpreted  as  coupling  between  the 
soliton  oscillator  and  the  external  resonator  [5]. 
By  applying  a  dc  magnetic  field  in  the  plane  of 
the  junction  and  thereby  changing  the  coupling. 


we  noted  that  the  substeps  were  approximately 
one  order  of  magnitude  more  sensitive  to  the 
magnetic  field  than  the  critical  current. 


3.  Discussion 

The  dominant  mode  in  the  present  geometry  is 
the  coplanar  resonator  mode,  where  the  millime¬ 
ter  wave  electric  and  magnetic  fields  are  mainly 
confined  to  the  narrow  space  between  the  strips. 

In  order  to  model  the  geometry  used  in  ref. 
[1],  Gronbech- Jensen  et  al.  developed  a  sim¬ 
ple  analytical  approach  [6],  where  the  overlap 
junctions  are  coupled  to  a  lumped  element  lin¬ 
ear  tank  circuit.  The  model  predicts  locking  be¬ 
tween  the  soliton  and  the  tank  circuit  when  the 
junction  is  operated  near  the  fundamental  reso¬ 
nance  frequency  of  the  tank  or  its  subharmonics 
[7].  This  has  been  confirmed  numerically.  The 
height  of  the  subharmonic  steps  was  seen  to  vary 
approximately  inversely  proportional  to  the  sub¬ 
harmonic  number  [7]. 

In  contrast  to  this  model,  the  distributed  res¬ 
onator  in  our  setup  will  exhibit  resonant  be¬ 
haviour  at  all  multipla  n  of  one  half-wavelength 
giving  rise  to  locking  whenever  the  soliton  fre¬ 
quency  equals  a  fractional  harmonic  n/m  of  the 
half-wavelength  resonance  frequency  Vc  of  the 
coplanar  mode.  Indeed,  the  position  of  the  steps 
in  fig.  2  corresponds  very  well  to  n/m  =  2/5 
and  nim  =  4/9,  assuming  that  the  hysteretic 
step  corresponds  lo  n/m  =  1/2. 

The  height  of  the  substeps  decreased  with  in¬ 
creasing  temperature.  At  a  temperature  wh 
the  substeps  were  completely  suppressed,  the 
junction  was  irradiated  with  microwaves  via 
the  microstrip  antenna  shown  at  the  top  of  fig. 
1,  and  rf-induced  phase-locked  steps  were  ob¬ 
served.  Not  only  locking  to  the  fundamental  fre¬ 
quency  was  seen,  but  also  locking  to  fractional 
harmonics,  following  the  structure  of  a  “devil’s 
staircase”  with  locking  at  n/m  =  {1/3,  2/5, 
3/7,  4/9,  5/11,...,  1/2}  was  observed.  Com¬ 
paring  to  fig.  2,  the  3/7  and  5/11  harmonics 
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Fig.  3.  Curve  showing  the  height  of  the  hysteretic  step  in  a 
two-junction  sample  (U3'2JJ).  The  first  minimum  of  the 
critical  current  vs  dc  magnetic  field  occurs  at  a  much  larger 
field  {B  =  320 //T).  T  =  4.2  K. 

are  missing.  However  by  applying  a  small  exter¬ 
nal  dc  magnetic  field  also  these  steps  actually 
appeared  in  the  IV -curve. 

Another  geometry  with  only  one  pair  of  junc¬ 
tions  showed  the  same  behaviour.  A  large  hys¬ 
teretic  step  occured  at  75  /tV,  together  with  a 
number  of  other  substeps.  By  comparing  the  dif¬ 
ferent  junctions,  it  was  noted  that  the  structure 
scaled  with  the  position  of  the  large  substep.  The 
symmetry  of  the  “devil’s  staircase”  around  1/2 
was  confirmed  by  the  following  sequence  of  ob¬ 
served  substeps:  «/m  =  {1/2,  6/11,  5/9,  4/7, 
3/5}.  For  even  values  of  n,  it  was  necessary  to  ap¬ 
ply  a  small  dc  magnetic  field  to  observe  the  sub¬ 
steps.  The  height  of  the  substep  was  smaller  for 
larger  n  in  agreement  with  theory.  Fig.  3  shows 
the  height  of  the  hysteretic  step  for  this  sample 
as  function  of  applied  magnetic  field. 
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Numerical  simulations  have  been  performed  for  an  annular  Josephson  junction  with  a  normalized  length  equal  to 
6,  using  the  perturbed  sine-Gordon  equation.  The  calculated  current-voltage  characteristics  reveal  a  small  jump  on 
the  steps  3  and  4  in  accordance  with  experimental  observations.  The  computer  simulations  show  that  the  jumps  are 
caused  by  a  transition  from  a  non-bunched  to  a  bunched  fluxon  oscillation  state  as  the  bias  current  is  increased. 


The  annular  Josephson  junction  allows  for 
the  study  of  undisturbed  motion  of  sine-Gordon 
solitons  (magnetic  flux  quanta)  in  the  absence 
of  boundaries  and  soliton-soliton  collisions.  Re¬ 
cently  the  dynamics  of  a  soliton  chain  trapped 
in  an  annular  Josephson  junction  has  been  in¬ 
vestigated  experimentally  [  1  ] .  The  experiment 
showed  the  evidence  for  a  crossover  in  the  soli¬ 
ton  chain  velocity  v  dependence  versus  the  driv¬ 
ing  force  y  (I-V  characteristics),  which  has  been 
suggested  to  be  due  to  soliton  bunching  [2]  at 
high  velocities.  The  surprising  fact  observed  in 
the  experiment  was  that  the  crossover  has  been 
found  only  for  the  configurations  with  n  =  3 
and  4  solitons  trapped  in  the  annular  Josephson 
junction  of  the  normalized  length  /  «  6.  For  the 
cases  of  n  =  2  and  n  >  4  no  specific  features 
have  been  observed  in  the  I-V  curves. 

In  this  contribution  we  report  on  our  recent 
numerical  simulations,  which  where  performed 
in  order  to  check  if  the  behavior  which  was  ob- 
rerved  in  the  experiment  [2]  can  be  explained 
by  the  soliton  bunching  effect. 

The  system  discussed  here  is  described  by  the 
perturbed  sine-Gordon  equation  [3], 

<Pxx  -  Vtt  =  sin  ^!>  +  atp,  -  fi^xxi  +  y,  ( 1 ) 


where  is  the  space  and  time  dependent 

superconducting  phase  difference.  The  spatial 
coordinate  x  is  normalized  to  the  Josephson 
penetration  depth  Xj ,  the  time  t  to  the  inverse 
plasma  frequency  Wq  ' ,  a  and  /?  are  the  dissipa¬ 
tion  coefficients,  and  y  is  the  bias  current.  For 
the  annular  junction,  the  boundary  conditions 
for  eq.  ( 1 )  are  periodic, 

=  ^0(0,0  +  2nn, 

(PxdJ)  =  <Px(0j).  (2) 

As  known  from  previous  studies  [4] ,  the  pres¬ 
ence  of  the  surface-loss  term  in  eq.  ( 1 )  leads  to 
a  spatially  oscillating  trailing  tail  behind  a  mov¬ 
ing  soliton,  which  appear  at  high  soliton  veloc¬ 
ity  Vs.  This  may  give  rise  to  bunched  (bound) 
states  of  solitons  in  the  chain,  thereby  breaking 
the  symmetry  in  the  system. 

For  solving  eq.  ( 1 )  with  the  boundary  con¬ 
ditions  (2)  we  were  using  a  pseudo-spectral 
Fourier  method  [5].  The  parameters  have  been 
chosen  close  to  the  experimental  values  [1]: 
/  =  6.0,  a  =  0.02,  p  =  0.02,  n  =  1,2,...,  6.  In 
order  to  achieve  a  stationary  dynamic  state,  for 
each  value  of  the  bias  current  y  the  integration 
was  running  to  let  the  solitons  make  from  60 
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Fig.  1.  The  calculated  l-V  curves  of  the  annular  Josephson 
nction  with  different  numbers  n  =  1,...,6  of  trapped 
tons. 


dv>/dx  dv>/dx 
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Fig.  2.  The  spatial  distribution  of  the  normalized  magnetic 
field  in  the  junction  on  step  3  at  >'=0.26  (a)  and  at  }'  =  0.34 
(b). 

up  to  1200  revolutions  in  the  system.  After  that 
the  average  voltage  v  =  {<l>t)  was  calculated  using 
both  temporal  and  spatial  averaging. 

The  normalized  I-V  characteristics  iy-v) 
have  been  calculated  for  a  different  number  of 
solitons  n  in  the  system.  The  results  are  shown 
in  fig.  1.  Apparently,  this  collection  of  curves 
looks  quite  similar  to  fig.  2a  of  experiment  [  1  ]. 
In  a  nice  agreement  with  the  experiment,  the 
crossover  manifested  by  small  steps  on  the 
curves  for  n  =  3  and  4  solitons  is  very  pro¬ 
nounced  (compare  with  fig.  2b  of  experiment 
[1]). 

The  spatial  magnetic  field  distributions 
(Px(x,t)  for  step  3  at  y=0.26  and  at  y=0.34 
are  shown  in  fig.  2.  It  is  apparent  that  below 


d^x  d^ydx 


0123456  0123456 
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F^  3.  The  spatial  distribution  of  the  normalized  magnetic 
field  in  the  junction  on  step  4  at  ]'=0.26  (a)  and  at  }'=0.34 
(b). 

the  small  step  in  the  y-v  curve,  the  three  soli¬ 
tons  are  equidistantly  spaced  (the  system  has  a 
translational  symmetry  in  the  x-direction  with 
a  period  of  l/n)  and  accordingly  the  solitons 
are  not  bunched.  However,  above  the  small  step 
we  observe  bunching  of  the  solitons.  A  similar 
crossover  from  non-bunched  to  bunched  soli¬ 
tons  are  detected  for  the  step  4  as  the  bias  cur¬ 
rent  y  is  increased  (see  fig.  3).  These  pictures 
prove  that  the  crossover  we  observe  in  the  exper¬ 
iments  is  due  to  the  bunching  of  solitons  in  the 
junction.  The  bunching  effect  "helps”  the  chain 
of  solitons  to  overcome  the  dissipative  losses. 
Their  average  velocity  becomes  higher  than  that 
of  a  single  soliton  at  the  same  bias  current. 

Discussion.  At  the  crossover  from  the  equidis¬ 
tantly  spaced  solitons  to  the  bunched  soliton  dy¬ 
namic  mode,  a  jump  to  higher  voltages  occurs. 
The  experiments  [  1  ]  show  a  relative  increase  in 
the  voltage  which  is  less  than  1%,  whereas  in  the 
numerical  simulations  this  relative  voltage  jump 
is  about  2%.  This  discrepancy  in  the  numerical 
simulations  may  be  attributed  to  the  influence  of 
the  neglected  spatial  extension  in  the  ^-direction 
of  the  annular  junction  where  the  inner  circum¬ 
ference  is  smaller  than  the  outer  one. 

In  the  numerical  simulations  the  step  hights 
follow  the  pattern  seen  in  the  experiments.  The 
second  step  is  higher  than  the  first  and  the  third 
one.  Similarly,  the  form  of  the  steps  5  and  6  in 
the  numerical  simulations  resembles  very  closely 


40 


M.P.  S0rensen  et  ai.  /  Soliton  bunching  in  annular  Josephson  junction 


the  form  of  the  steps  8  and  9  in  the  experiment 
shown  in  ref.  [  1  ].  The  fact  that  three  more  steps 
are  seen  in  the  experiments  indicates  that  the 
experimental  junction  is  9  times  the  magnetic 
penetration  depth  and  not  6  as  anticipated  in  the 
numerical  simulations. 

We  are  grateful  to  Alwyn  C.  Scott  and  Niels 
Gronbech-Jensen  for  stimulating  discussions. 
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It  is  known  that  the  long  annular  Josephson  junction  allows  to  avoid  the  influence  of  boundaries  and  soliton(fluxon)- 
soliton  collisions  on  the  dynamics  of  soliton  chains.  With  new  experimental  methods  and  junction  design  a  fully  controlable 
way  of  introducing  individual  solitons  into  such  a  system  is  realized.  For  the  homogeneous  junction  crossover  at  high 
velocities  of  the  fluxon  chain  is  seen  and  explained  in  terms  of  the  fluxon  bunching.  In  the  periodically  modulated  junction 
a  new  resonant  mode  called  a  supersoliton  is  observed. 


1.  Introduction 

Josephson  junction  fluxons  represent  a  well- 
known  example  of  nonlinear  electro-magnetic 
waves  [1].  The  system  is  described  by  the  per¬ 
turbed  sine-Gordon  equation  [2].  In  case  of 
conventional  topology  of  a  long  Josephson  junc¬ 
tion,  a  moving  soliton  (magnetic  flux  quantum, 
or  fluxon)  cannot  avoid  collisions  with  other 
solitons  and  the  junction  boundaries.  This 
strongly  complicates  the  analysis  and  interpreta¬ 
tion  of  the  experimental  data. 

This  paper  emphasizes  new  results  obtained 
for  the  ideal  experimental  system  -  the  annular 
Josephson  junction  [3].  In  the  annular  junction 
one  may  avoid  fluxon  collisions  and  study  in  a 
controllable  way  the  motion  of  individual  flux¬ 
ons.  With  new  experimental  methods  and  junc¬ 
tion  design  we  have  realized  for  the  first  time  a 
fully  controllable  way  of  introducing  individual 
solitons  into  such  a  system.  This  has  allowed  us 
to  study  multisoliton  dynamics  without  collisions 
with  boundaries  and  to  make  a  comparison  with 
existing  soliton  chain  perturbation  theory.  Two 
types  of  junctions,  homogeneous  and  spatially 
modulated  rings,  have  been  investigated. 


2.  Experimental  methods  of  fluxon  trapping 

Annular  junctions  with  the  inner  radius  of  the 
ring  100  jim  and  with  the  width  of  the  tunnel 
barrier  ring  20  jjum  were  used.  The  critical  cur¬ 
rent  density  was  about  40A/cm^,  which  corre¬ 
sponds  to  the  characteristic  size  of  a  fluxon  kj  = 
TOp-m.  In  order  to  achieve  a  high  homogeneity 
of  the  tunnel  barrier  it  was  formed  in  the  area 
between  two  insulating  SiO  rings  covering  the 
edges  of  the  bottom  electrode.  On  each  substrate 
together  with  homogeneous  junctions  we  have 
fabricated  several  modulated  annular  UJ.  In 
order  to  provide  a  periodic  modulation  of  the 
critical  current  density  along  the  LJJ,  the  regular 
lattice  of  inhomogeneities  was  made  using  SiO 
strips  placed  across  the  junction. 

We  have  realized  experimentally  two  different 
techniques  for  trapping  the  magnetic  flux  in  the 
junctions.  First,  using  low  temperature  scanning 
electron  microscopy  (LTSEM)  we  were  able  to 
trap  one  by  one  several  magnetic  flux  quanta  in 
the  tunnel  barrier  area  of  the  Josephson  junc¬ 
tion.  Second,  by  applying  the  external  mono¬ 
pole-like  magnetic  field  (MMF)  using  a  special 
coil  we  trapped  the  flux  in  the  junction  at  the 
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Fig.  1.  (a)  The  magnetic  flux  trapping  procedure  in  an 
annular  junction  using  LTSEM.  (b)  The  cross-section  of  the 
junction  along  the  dashed  line  shown  in  (a)  at  the  moment  of 
crossing  by  the  electron  beam  (the  insulating  SiO  window  is 
not  shown). 


temperature  near  the  critical  temperature  of  the 
superconducting  electrodes. 

In  the  LTSEM  method,  the  focused  electron 
beam  was  used  for  trapping  magnetic  flux  quanta 
in  the  annular  junctions.  The  crossing  of  the  top 
electrode  by  the  electron  beam  focus  resulted  in 
trapping  the  magnetic  flux  in  the  tunnel  barrier 
(fig.  la).  This  was  due  to  the  local  heating  of  the 
sample  in  the  focus  of  the  beam.  A  normal 
domain  carrying  the  magnetic  flux  was  moved 
across  the  Pb  film  together  with  the  electron 


beam  focus  as  shown  in  fig.  lb.  The  details  of 
this  technique  are  presented  elsewhere  [4]. 

In  the  MMF  method  we  have  used  a  current- 
carrying  ring  above  the  junction  in  order  to 
produce  a  radial  magnetic  field  component  along 
the  junction  circumference.  The  magnetic  flux 
happens  to  be  trapped  in  the  junction  at  the 
temperature  close  to  the  critical  temperature  of 
the  superconducting  electrodes.  The  details  are 
described  elsewhere  [5]. 


3.  Results  and  discussion 

Fig.  2a  shows  the  experimental  I-V  charac¬ 
teristics  for  the  trapping  of  solitons  in  the  annu¬ 
lar  Josephson  junction  obtained  by  a  sequential 
crossing  of  the  junction  with  the  electron  beam. 
Typically,  each  crossing  resulted  in  trapping  a 
one  flux  quantum  in  the  ring.  The  critical  current 
of  the  junction  without  trapped  flux  was  about 
3.3  mA.  The  resonant  soliton  steps  corre- 


Fig.  2.  (a)  The  I-V  curves  corresponding  to  the  sequential  trapping  of  solitons  in  the  homogeneous  annular  Josephson  junction 
using  the  LTSEM  method.  The  numbers  correspond  to  the  number  of  solitons  trapped,  (b)  The  detailed  comparison  of  the 
high-current  parts  of  the  scaled  soliton  steps  in  a  magnified  voltage  scale  (the  voltage  of  the  nth  step  is  divided  by  n). 
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spending  to  different  numbers  of  unipolar 
solitons  n  are  observed.  The  voltages  of  the  steps 
are  quantized  as  expected,  according  to  the 
formula:  =  h^qCIL,  where  the  magnetic 

flux  quantum,  c  the  maximum  velocity  of  the 
electromagnetic  waves  in  the  junction,  and  L  is 
the  junction  circumference.  At  the  step  voltage 
the  solitons  move  as  relativistic  particles  with 
the  velocity  close  to  c.  If  solitons  are  trapped  in 
an  ideally  homogeneous  annular  junction,  theo¬ 
retically  the  non-dissipative  critical  current  4 
should  be  equal  to  zero.  This  is  perfectly  fulfilled 
for  all  the  curves  in  fig.  2a  and  is  a  good  proof 
for  the  high  homogeneity  of  our  junctions.  The 
multi-soliton  states  for  the  perturbed  sine-Gor- 
don  system  with  periodic  boundary  conditions 
were  analyzed  by  Marcus  and  Imry  [6].  The 
tendency  to  reach  an  asymptotic  linear  slope 
with  the  increasing  number  n  of  solitons  pre¬ 
dicted  from  theory  [6]  is  clearly  seen  in  the 
experiment. 

Fig.  2b  shows  the  current  plotted  versus  the 
voltage  per  fluxon  V^/n  and  displays  the  top 
parts  of  the  steps  in  a  magnified  voltage  scale. 
Since  V„ln  is  proportional  to  the  average  fluxon 
velocity  u,  we  can  compare  the  dependence  of  v 
on  the  external  driving  force  /  for  different 
fluxon  densities.  With  increasing  step  number  of 
crossover  in  the  step  voltage  is  clearly  seen.  At 
certain  fluxon  velocities  the  steps  with  n  =  3  and 
n  =  4  display  a  kind  of  shift  to  higher  voltages, 
which  is  not  expected  from  theory  [6].  As  it  was 
known  already  from  early  computer  simulations 
(A.C.  Scott  and  W.J.  Johnson,  unpublished, 
1968),  the  dissipation  due  to  the  surface  (/3) 
losses  [7,8]  at  high  velocity  generates  an  effective 
attraction  between  unipolar  solitons  and  results 
in  bunching  in  a  localized  soliton  train  moving  in 
the  junction.  This  bunching  effect  breaks  the 
symmetry  in  the  annular  system  and  “helps”  the 
chain  of  solitons  to  overcome  the  dissipative 
losses.  Their  average  velocity  then  becomes 
higher  than  that  of  a  single  soliton  at  the  same 
bias  current.  Recent  numerical  simulations  [9] 
have  shown  a  nice  agreement  with  our  experi¬ 


ment,  also  displaying  the  crossover  at  high  ve¬ 
locity  for  the  cases  of  3  and  4  solitons. 

In  periodically  modulated  rings  we  observe  a 
new  resonant  mode,  called  the  supersoliton 
mode,  which  has  been  found  before  in  numerical 
simulations  [10]  and  described  theoretically  [11]. 
This  mode  is  justified  by  the  additional  voltage 
steps  in  the  current-voltage  characteristics  of  the 
junction,  which  appear  when  the  spatial  periods 
of  solitons  and  the  junction  modulation  are 
incommensurate.  Fig.  3  shows  the  I-V  charac¬ 
teristics  of  the  modulated  annular  junction  with 
N  =  5  inhomogeneities  (the  modulation  period  is 
equal  to  5  of  the  junction  circumference)  and 
/I  =  6  solitons  trapped  in  it  using  the  MMF 
method.  Each  inhomogeneity  represents  a  region 
in  the  junction  where  the  critical  current  density 
is  suppressed,  so  it  attracts  a  soliton.  It  is 
energetically  advantageous  to  place  the  6-th 
soliton  as  a  defect  (supersoliton)  on  already 
pinned  regular  lattice  of  5  other  solitons.  Under 
the  influence  of  bias  current  this  defect  (super¬ 
soliton)  moves  along  the  junction,  while  all  the 
other  solitons  remain  being  pinned.  In  the  I-V 
curve  this  mechanism  leads  to  the  pronounced 
first  step  at  V=«32p.V  in  fig.  3.  The  detailed 
explanation  for  the  features  observed  in  the  I-V 
curves  may  be  found  elsewhere  [5]. 


1 

1 


Fig.  3.  The  l-V  curve  of  the  junction  with  N  =  5  inhomo- 
geneities  and  n  =  6  trapped  solitons  using  the  MMF  method. 
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New  infrared  measurements  of  the  anomalous  amide  modes  in  acetanilide  and  its  derivatives  are  presented.  Preliminary 
results  of  structural  data  obtained  by  neutron  diffraction  at  low  temperature  are  also  described. 

Besides  the  well-known  anomalous  amide-1  mode  (1650 cm  '),  it  is  shown  that  the  NH  out-of-plane  bend  (770cm  ') 
and  the  "H-bond  strain"  (at  about  105  cm  ')  exhibit  an  anomalous  increase  of  intensity  proportional  to  the  law 
expf-T'/®’),  suggesting  that  the  amide  proton  bears  a  significant  electronic  distribution  as  formerly  observed  for  H 
localized  modes.  Structural  data,  moreover,  show  that  the  thermal  ellips  of  the  amide  proton  has  an  increasing  anisotropy 
at  15  K. 

Considering  these  new  results,  the  theoretical  model  of  a  self-trapped  “polaronic"  state  seems  to  be  the  most  consistent 
with  the  whole  set  of  observed  anomalies  in  this  family  of  crystals. 


1.  Introduction 

The  orthorhombic  acetanilide  crystal 
(C^H,NHCOCH,  or  ACN)  and  its  deuterated 
derivatives  contain  one-dimensional  networks  of 
hydrogen-bonded  .  .  .  H-N-C-O  •  ■  •  amide 
groups,  like  polypeptides  and  alpha-helix  pro¬ 
teins  [1].  The  family  of  ACN  compounds  display 
some  anomalous  infrared  and  Raman  modes  [2], 
the  origin  of  which  is  still  a  subject  of  consider¬ 
able  controversy.  Tentative  explanations  of  these 
extra-intensities  observed  at  low  temperature 
have  been  given  in  terms  of  Davydov-like  soliton 
theory  [3],  vibron  soliton  [4],  “polaronic”  local¬ 
ized  mode  [5,6],  nonlinearly  coupled  oscillators 
[7,8],  Alternative  interpretations  involve  Fermi 
resonance  [9]  and  more  recently  a  model  based 
on  two  slightly  nondegenerate  configurations  of 
the  amide  proton  [10], 

In  this  paper  we  present  recent  data  focused 
on  the  unconventional  amide  modes  and  propose 
a  new  comprehensive  view  of  the  anomalies  in 
ACN  family  in  terms  of  a  hydrogen  localized 
mode  similar  to  the  U  centers  formerly  observed 
in  ionic  crystals  [11], 


2.  New  infrared  results 

2.1.  The  amide  A,  B,  region  [2,12] 

Absorption  measurements  of  selectively  deu¬ 
terated  ACN  and  of  the  methyl  fluorinated  ACN 
(abbreviation  3F)  [13]  allow  to  estimate  the  rela¬ 
tive  strength  W  of  the  hydrogen-bond  in  each 
derivative. 

i/jF  =  3303 cm”'  ,  =  3281  cm”'  , 

j/ps  =  3275  cm  '  ,  i'ds  ~  3273  cm” '  , 
t'D8  =  3272cm”' ,  =  2414cm”' , 

it  follows  that  [14] 

<  M^ACN  <  VVd5  <  <  1^09  ■ 

(ND  means  CjHjNDCOCHj;  D3  means 
CftH.NHCOCDj;  D5  means  C^DjNHCOCHj; 
and  so  on.) 

The  energy  shift  of  i'(NH)  in  ACN  as  a  func¬ 
tion  of  the  temperature  exhibits  a  characteristic 
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sigmoid  shape,  representative  of  the  existence  of 
competing  interactions,  while  i'(ND)  obeys  a 
normal  monotonous  temperature  dependence. 

2.2.  The  amide- 1  region 

Our  new  data  are  summarized  in  fig.  1.  The 
amide-1  mode  exhibits  a  classical  thermal  be¬ 
haviour  in  D3  and  D8:  the  frequency  decreases 
with  decreasing  temperature  and  no  extra-peak 
is  observed.  On  the  contrary  the  anomalous 
band  is  present  in  D5  and  in  3F,  at  1647.5 cm“‘ 
and  1693cm''‘,  at  20  K,  respectively.  The  energy 
shift  between  the  fully  delocalized  exciton  (stan¬ 


dard  amide-1  frequency)  and  the  supposed  self- 
trapped  state  or  “soliton”  (anomalous  mode  fre¬ 
quency)  is  Af  =  16.5 cm“‘  in  pure  ACN,  A£  = 
15  cm'*  in  D5,  and  A£  =  11  cm'*  in  3F. 

2.3.  The  amide-V  modes 

The  anomalous  thermal  behaviour  of  the  in¬ 
tensity  of  this  mode  (or  y(NH))  [15-18]  consists 
of  an  increase  of  the  integrated  intensity  of  about 
20%  or  more,  when  the  temperature  decreases 
from  300  K  to  20  K.  Fig.  2  shows  the  anomalous 
behavior  in  ACN,  at  772  cm”*  at  about  20  K. 


1650  1700 

WAVENUMBERS  cnr’ 


WAVENUMBERS  cm' 


Fig.  1.  Amide-I  modes  in  four  derivatives  of  ACN;  the  D5  and  3F  compounds  exhibit  an  anomalous  mode  like  pure  ACN,  with  a 
very  small  energy  shift  in  the  range  300-20  K.  On  the  contrary,  the  D3  and  D8  compounds  display  a  classical  behaviour  without 
anomaly. 
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Fig.  2.  Amide-V  modes  (or  NH  out-of-plane  bend)  in  ACN 
and  D5,  showing  an  increase  of  the  intensity  at  20  K.  The 
structure  at  about  760  cm "  ‘  in  the  ND  compound  is  assigned 
to  the  C-H  out-of-plane  bends  in  the  aromatic  ring. 

The  Other  modes  around  760cm~‘  are  assigned 
to  the  C-H  out-of  plane  bends  in  the  phenyl 
ring.  Their  intensity  is  quasi-independent  of  the 
temperature.  The  anomaly  is  also  observed  in 
D3,  D5,  D8,  and  3F  compounds. 

The  temperature  dependence  of  the  anomal¬ 
ous  intensity  [17]  is  compatible  with  the  predic¬ 
tions  of  the  “polaronic”  model  [5,6],  or  of  the 
“vibron  soliton”  model  [4], 

2.4.  The  anomalous  modes  at  about  1500  cm  ' 

They  were  first  identified  [17,14]  in  D8  at 
about  1490 cm'*.  Fig.  3  indicates  that  the  same 
anomaly  exists  in  ACN  as  a  strong  increase 
(more  than  30%)  of  the  integrated  intensity  of 


Fig.  3.  Anomalous  increases  of  intensities  at  20  K  in  ACN,  in 
D5  and  in  3F. 

the  doublet  (1492  and  1504  cm~‘).  Same  strong 
anomalies  are  observed  in  D3,  D5,  and  3F,  at 
1498,  1492cm“',  and  1400-1411  cm" ‘  respective¬ 
ly.  No  anomaly  is  observed  in  this  frequency 
range  on  the  ND  compound. 

These  anomalous  modes  are  tentatively  as¬ 
signed  to  the  overtone  of  the  anomalous  -/(NH) 


48 


M.  Barthes  el  al.  /  H-locahzed  mode  in  chains  of  hydrogen-bonded  amide  groups 


mode.  They  are  also  accidentally  degenerated 
with  the  C-C  stretch  of  the  phenyl  ring,  except 
in  3F.  The  C-C  stretching  modes  are  observed  in 
the  ND  compound,  with  a  classical  temperature 
behaviour. 

An  increase  of  intensity  is  also  observed  at 
about  1430  cm”'  in  ACN  and  D5  only,  for  a 
mode  assigned  to  a  methyl  motion. 

2.5.  The  amide-II  and  amide-ill  modes 

The  6(NH)  or  in-plane  bend  of  the  amide 
proton,  at  about  1560 cm”',  and  the  N-C 
stretching  modes  (presumably  at  1260  and 
1320 cm”')  do  not  show  any  anomalous  be¬ 
havior. 

2.6.  Far-infrared  modes 

Far  infrared  spectra  are  dominated  by  the 
features  at  100-1 10  cm  ”'  in  ACN.  This  mode 
displays  a  strong  increase  of  intensity:  about 
50%  in  the  whole  temperature  range  (fig.  4). 
The  temperature  dependence  of  its  integrated 
intensity  is  compatible  with  a  law«exp(-r^/ 
0^),  its  width  is  consistent  with  a  power  law 
description.  All  these  three  specific  properties 
are  strikingly  resembling  that  of  the  fundamental 
vibrational  mode  (or  zero-phonon  mode)  of  the 
H”  ion  localized  oscillator,  extensively  investi¬ 
gated  in  alkali  halides  [11].  Furthermore,  the 
energy  of  this  mode  is  related  to  the  elastic 
constants  of  ACN  in  the  same  way  as  the  H” 
localized  mode  in  alkali  halides  and  may  be 
scaled  on  the  same  plot  (cf.  fig.  7-2,  in  ref.  [11]). 

Its  frequency  shift  with  successive  deuterations 
is  not  related  with  the  molecular  mass,  but  with 
the  H-bond  strength  (for  example  the  maximum 
is  found  at  107  cm”'  in  ACN,  107cni”‘  in  D5, 
109cm”'  in  D3  at  about  20 K).  Thus  it  is  not  a 
simple  classical  external  mode,  but  it  'Bvolves 
the  H-bond.  It  is  worth  to  remark  that  in  D3  and 
D8  the  mode  appears  to  be  split  and  is  acciden¬ 
tally  degenerated  with  the  deuterated  methyl 
(CD3)  torsion  [15], 

The  motion  associated  to  this  major  intensity 
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Fig.  4.  Evolution  of  the  intensity  of  the  “H-bond  strain"  in 
ACN  with  respect  to  the  temperature. 


at  100-1 10 cm”'  may  be  tentatively  assigned  to 
the  “H-bond  strain”  by  analogy  with  the  same 
features  in  other  amide  compounds  [19].  It  con¬ 
sists  on  a  distorsion  normal  to  the  (b)  H-bond 
direction,  or  a  hindered  rotation  of  the  whole 
molecule  around  the  a  axis  involving  a  significant 
change  of  the  electric  dipole  moment,  so  giving 
rise  to  a  strong  infrared  intensity.  Much  atten¬ 
tion  has  been  given  to  the  H-bond  strain  mode 
because  it  is  suspected  to  be  the  driving  mecha¬ 
nism  for  the  opening  of  the  double  helix  in 
transcription  and  replication  processes  in  DNA 
[19]. 

3.  Low  temperature  neutron  diffraction 

The  low  temperature  crystallographic  structure 
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Fig.  5.  ORTEP  drawings  of  the  amide  group  in  the  ACN 
molecule  at  15  K,  in  the  (a,  b)  molecular  plane,  and  perpen¬ 
dicular.  Note  the  large  anisotropy  of  the  //(I)  ellipsoid  in  the 
c  direction. 


of  ACN  has  been  measured  by  neutron  diffrac¬ 
tion  with  the  aim  of  determining  the  amide 
proton  localisation.  Preliminary  results  indicate 
that  the  thermal  ellipsoid  of  the  proton  is  elon¬ 
gated  in  the  c  direction,  that  is  perpendicular  to 
the  molecular  plane.  The  axis  ratios  of  the  ellip¬ 
soid  are: 

c  c 

-  =  1.05  and  -  =  1.69  at  room  temperature, 

-  =  1.54  and  7  =  2.05  at  15  K. 

a  b 

reflecting  an  increase  of  the  localization  of  the 
proton  along  the  c  axis  as  the  temperature  de¬ 
creases  (fig.  5).  However,  the  difference  Fourier 
maps  show  only  one  maximum  density  region, 
and  not  two  as  could  be  expected  in  the  model  of 
slightly  non-degenerate  configurations  of  the 
proton. 


4.  Discussion 

The  hypothesis  of  a  negative  charge  distribu¬ 
tion  borne  by  the  amide  hydrogen  atom  was 
formerly  suggested  to  explain  some  special  NMR 
chemical  shifts  in  H-bonded  systems  [20].  The 
existence  of  some  positive  charge  density  local¬ 
ized  on  the  nitrogen  atom  was  also  conjectured 


by  Pauling  [21].  Moreover,  the  H"  ion  existence 
as  an  impurity  of  a  color  center  in  alkali  halides 
is  well  established. 

The  H-bond  is  the  result  of  a  delicate  balance 
between  electrostatic  attractive  interactions,  di¬ 
polar  forces,  and  repulsion  from  electronic  orbi¬ 
tals  of  neighbouring  ions.  In  a  classical  system 
like  ND,  the  attractive  interactions  dominate  and 
the  bond  strength  monotonically  increases  as  the 
temperature  decreases. 

In  ACN  on  the  contrary,  competition  occurs 
and  explains  the  S-shaped  curve  of  the  energy 
shift  of  the  r(NH)  mode. 

Assuming  that  some  negative  charge  density  is 
bound  to  the  proton,  it  is  possible  to  explain  that 
repulsive  interactions  increases  as  the  O. .  .H 
distance  decreases  with  decreasing  temperature, 
thus  explaining  the  strong  localization  of  the 
hydrogen  ion  in  the  c  direction,  perpendicular  to 
the  bond.  It  is  now  easy  to  understand  why  both 
the  H-bond  strain  mode  and  the  y(NH)  mode 
increases  in  intensity  and  energy  with  decreasing 
temperature. 

The  anomalous  peak  at  1650  cm"'  may  be  a 
self-trapped  state  resulting  from  a  nonlinear  cou¬ 
pling  of  the  C  =  O  stretch  with  transverse  mo¬ 
tions  like  (in-phase  and  out-of-phase)  librations 
around  the  a  axis,  instead  of  the  longitudinal 
modes  considered  in  the  past.  Nevertheless,  the 
experimental  values  of  A  E  are  not  in  contradic¬ 
tion  with  the  well  known  prediction  [2]: 

AE  =  x^/2W-2J  . 

Following  our  hypothesis,  the  disappearance 
of  the  anomalous  mode  in  the  methyl  deuterated 
compounds  (D3  and  D8)  is  no  more  mysterious. 
Remembering  the  accidental  degeneracy  of  the 
deuterated  methyl  libration  with  the  “H-bond 
strain”  mode,  it  is  clear  that  the  former  coun¬ 
teracts  with  the  latter,  so  probably  restraining  its 
amplitude  and  prevents  its  coupling  with  the 
C  =  O  stretch.  Fermi  resonance  mechanisms  in¬ 
volving  either  the  CHj  “torsional  mode”,  or  the 
“H-bond  strain”  may  be  ruled  out  because  the 
anomalous  amide- 1  mode  still  exists  in  3F  at 
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about  1693 cm"',  in  contradiction  with  such  ex¬ 
planation. 

As  a  conclusion,  we  have  shown  that  a  very 
simple  hypothesis  of  a  center  in  the  amide 
group  suffices  to  explain  the  whole  set  of 
anomalous  features  in  acetanilide,  which,  sub¬ 
mitted  to  competing  opposite  interactions,  is  a 
subtle  frustrated  system.  On  the  other  hand  we 
hope  that  the  present  data  give  useful  informa¬ 
tions  on  the  general  knowledge  of  the  hydrogen- 
bond,  involved  in  most  biological  processes. 
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The  nonresonant  w-electronic  component  of  y{-3ai;  ta,  a>,  o>),  y,,  is  calculated  and  analyzed  for  two  important  types  of 
linear  conjugated  rr-electron  molecules:  linear  cyanine  cations  and  linear  polyenes.  The  specific  linear  cyanine  cations  are 
all- rram,  linear  symmetric  cyanine  and  streptocyanine  cations  with  no  double-single  bond-length  alternation,  and  cyanine 
cations  with  asymmetric  geometries  resulting  from  the  artificial  imposition  of  double-single  bond-length  alternation.  The 
linear  polyenes  of  interest  are  A\-trans  linear  polyenes  with  double-single  bond-length  alternation,  and  those  with 
geometries  resulting  from  the  artificial  imposition  of  no  double-single  bond-length  alternation.  Bond-length  alternation  is 
found  to  dramatically  affect  y,  for  the  linear  cyanines:  the  -y^’s  for  the  symmetric  cyanines  are  calculated  to  be  negative;  the 
■y,’s  for  the  assymmetric  cyanines  change  from  negative  to  positive  with  increasing  chain  length.  The  -y.’s  for  the  linear 
polyenes  are  always  positive  regardless  of  the  extent  of  bond-length  alternation;  the  |y,|'s  for  the  linear  polyenes  increase 
with  decreasing  bond-length  alternation.  The  ly.l’s  for  the  symmetric  linear  cyanines  increase  more  rapidly  with  the 
number  of  w-electrons  than  the  ly^l’s  for  the  linear  polyenes:  ly, (symmetric  cyanines)| « and  ly,(linear 
polyenes)!  “  ^ •  where  =  4,  6,  8,  10,  12.  The  linear  cyanine  cations  comprise  a  very  promising  class  of  nonlinear 

optical,  rr-electron  molecules  that  merit  further  experimental  study. 


1.  Introduction 

Nonlinear  optical  phenomena  are  increasingly 
being  utilized  in  a  variety  of  photonic  applica¬ 
tions.  The  second-order  nonlinear  optical  pro¬ 
cess  of  second-harmonic  generation  is  used  to 
extend  the  frequency  ranges  of  lasers  [1-3]. 
Electro-optic  modulators  and  switches  (e.g.,  Poc- 
kel’s  cell)  operate  via  the  second-order  nonlinear 
optical  effect  of  an  electric-field-induced  change 
in  refractive  index  [3,4].  The  third-order  non¬ 
linear  optical  phenomenon  of  a  light-induced 
change  in  refractive  index  is  fundamental  to 
soliton  light-pulse  propagation  in  optical  media 
[5],  optical  switching  and  computing  [6,7],  as 
well  as  phase  conjugate  adaptive  optics  [8].  The 
efficiencies  of  these  nonlinear  optical  processes 
are  dependent  upon  the  material  employed  to 
couple  the  given  combination  of  electrical  and/ 
or  optical  signals. 

Organic  molecules  and  polymers  with  large 


nonlinear  electronic  polarizabilities  are  of  great 
interest  world-wide  as  nonlinear  optical  (NLO) 
materials  [9].  The  organic  molecules  with  the 
largest  nonlinear  electronic  polarizabilities  tend 
to  fall  in  one  of  three  classes  of  molecules  with 
conjugated  Tr-electron  bonding  networks: 
(1)  polyenes  and  related  compounds  with  pro¬ 
nounced  TT-electron  bond-order  alternation, 
e.g.,  the  asymmetric  merocyanines,  H2N- 
CH=(CH-CH)„=0,  (2)  symmetric  polymethines 
with  attenuated  ir-electron  bond-order  alterna¬ 
tion,  e.g.,  the  symmetric  cyanine  cations, 
H2Ntt7CHttt(CHtttCH);— NHj,  and  (3)  por¬ 
phyrins  and  phthalocyanines.  The  conjugated  Tr- 
electron  bonding  networks  in  these  molecules 
not  only  endow  them  with  large  nonlinear  tt- 
electronic  polarizabilities,  but  with  large  linear 
TT-electron  polarizabilities,  and  high  one-photon 
absorptivities  in  the  visible  and  near-IR  spectral 
regions  [9]. 

The  attenuated  Tr-bond-order  alternation  for 
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the  symmetric  polymethines  results  from  a  tt- 
eiectron  ground  state  that  can  be  described  as  a 
resonance  hybrid  of  two  equivalent  structures, 
eg.,  ^H2N=CH-(CH=CH)„NH2«^H2N-CH= 
(CH-CH)„=NH2  for  the  symmetric  cyanine 
cations.  The  pronounced  bond-order  alternation 
in  the  polyene-like  molecules  results  from  a 
TT-electron  ground  state  that  is  best  described 
by  a  single  structure,  e.g.,  the  ^H2N=CH- 
(CH=CH)„-0^  structure  for  the  merocyanines 
in  a  vacuum  is  not  equivalent  to  the  more  stable 
H2N-CH=(CH-CH)„=0  structure. 

In  free-electron  theory,  the  potential  energy  of 
a  TT-electron  in  a  symmetric  polymethine  can  be 
approximated  by  a  constant  value  because  of  the 
TT-electron  density  delocalization  associated  with 
the  resonance  hybrid;  the  potential  energy  of  a 
TT-electron  in  a  polyene-like  molecule  can  be 
approximated  by  a  periodic  function  with  the 
troughs  and  crests  of  the  function  coinciding  with 
the  double  and  single  bonds  [10].  An  important 
consequence  of  these  different  potential  energy 
functions  is  that  the  energy  difference  between 
the  highest  occupied  molecular  orbital  (HOMO) 
and  the  lowest  unoccupied  molecular  orbital 
(LUMO)  goes  to  zero  with  increasing  chain 
length  for  symmetric  polymethines,  but  con¬ 
verges  to  a  finite  value  with  increasing  length  for 
polyenes.  In  other  words,  symmetric  cyanines 
become  metallic  in  the  limit  of  infinite  length, 
while  the  polyenes  resemble  semiconductors  in 
this  limit  [11,12]. 

This  free-electron /Hueckel  theoretical  picture 
is  in  agreement  with  experimental  and  theoret¬ 
ical  studies  of  the  linear  optical  properties  of 
symmetric  polymethines  and  polyene-like  mole¬ 
cules.  For  example,  there  is  the  experimental 
observation  that  for  the  first  one-photon 
allowed  ’tttt^^-Sj,  transition  increases  linearly 
with  chain  length  for  symmetric  all-trans,  poly¬ 
methine  cations  with  two  terminal  dimethyl 
amino  groups  [(CH3)2NttK:!H— (CHtttCH),!^— 
N(CH3)2  or  streptocyanines]  [13],  but  reaches  a 
limiting  value  in  the  case  of  a\l-trans-a,  co- 
dialkyl-polyenes  [14],  and  the  dimethylamino- 


merocyanines  [13],  (CH3)2N-CH  =  (CH- 
CH)„  =  O.  Also,  the  linear,  static  electric-dipole 
polarizabilities  of  symmetric  cyanine  cations 
have  been  measured  [IS]  and  calculated 
[12,16,17]  to  be  larger  than  those  for  all-tram- 
linear  polyenes  of  comparable  methylation, 
length,  and  number  of  Tr-electrons. 

One  would  also  expect  the  nonlinear  optical 
properties  of  symmetric  polymethines  to  be  dif¬ 
ferent  from  those  of  polyene-like  molecules. 
Third-harmonic  generation  (THG)  measure¬ 
ments  of  the  nonresonant  electronic  component 
of  the  third-order,  electric-dipole  polarizability 
[y,{-3ti>;  a»,  it),  (o),  with  tt>  =  1.91  jim]  for  a  vari¬ 
ety  of  symmetric  cyanine  dyes  in  the  condensed 
phase  yielded  y,(-3w;  to,  to,  ti))’s  with  both  nega¬ 
tive  and  positive  signs,  while  the  measurements 
for  a  selection  of  polyenes  yielded  only  positive 
y,(-3a>;  a>,  w,  a))’s  [15];  the  magnitudes  of  the 
y,(-3ct>;  (1),  o),  ti>)’s  measured  for  streptocyanines 
met  or  exceeded  those  for  a,  a,  a>,  w-tetra- 
methyl-polyenes  of  comparable  size  and  number 
of  TT-electrons  [15].  The  THG  measurements  at 
(o  =  1.85-2.15  M,m  for  the  polycrystalline  powder 
of  a  symmetric  cyanine  dye  with  terminal 
quinoline  rings  determined  |y,(-3w;  w,  to,  £«>)|  to 
be  greater  than  that  of  the  well-studied  poly¬ 
diacetylene  system,  poly-(2, 4-hexadiyn-l,  6-diol- 
bis-(p-toluene  sulfonate)),  or  PDA-PTS  [18].  A 
recent  study  of  the  third-order  nonlinear  optical 
properties  for  a  set  of  zwitterionic  polymethine 
dyes,  the  squaryliums,  emphasizes  the  impor¬ 
tance  of  the  polymethines  as  a  class  of  nonlinear 
optical  molecules  distinct  from  the  polyenes  [19]. 

The  principal  subject  of  this  paper  is  the  calcu¬ 
lation  and  analysis  of  the  nonresonant,  Tr-elec- 
tronic  components  of  third-order,  electric  dipole 
polarizabilities  for  a  set  of  linear  streptocyanines 
and  unmethylated  streptocyanines  (hereafter 
known  simply  as  cyanines).  In  contrast  to  earlier 
theoretical  studies  of  these  polarizabilities  for 
linear  cyanines  [16,17],  we  (1)  use  a  sum-over- 
states  expression  derived  from  time-dependent 
perturbation  theory  [20]  to  calculate  the  tt -elec¬ 
tronic  component  of  the  dynamic  third-order 
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polarizability  for  third-harmonic  generation, 
as  well  as  the  static  third- 
order  polarizability,  'y„(0;  0, 0, 0);  and  (2)  treat 
the  IT-electronic  states  using  the  all-valence-elec- 
tron,  semiempirical  INDO  (intermediate  neglect 
of  differential  overlap)  molecular  orbital  proce¬ 
dure  combined  with  single-  and  double-excita¬ 
tion  configuration  interaction  (SDCI)  of  the  sing¬ 
let  TT-electron  configurations,  instead  of  using 
original  [16]  or  refined  [17]  versions  of  Kuhn’s 
free-electron  method  [10].  Although  our  theoret¬ 
ical  method  is  more  complicated  than  those 
based  on  free-electron  theory,  it  has  been  shown 
to  provide  a  consistent  treatment  of  the  Tr-elec- 
tronic  components  of  the  third-order  polariz¬ 
abilities  and  the  linear  and  nonlinear  spectro¬ 
scopic  properties  of  the  'vir*  states  of  linear 
polyenes  and  benzene  [21].  Critical  to  our  con¬ 
sistent  calculation  of  these  properties  is  the  treat¬ 
ment  of  TT-electron  correlation  at  least  at  the 
level  of  SDCI  [21]. 

We  calculated  y„’s  for  aU-trans,  linear  sym¬ 
metric  cyanine  and  streptocyanine  cations  with 
no  double-single  bond-length  alternation,  and 
for  cyanine  cations  with  asymmetric  geometries 
resulting  from  the  artificial  imposition  of  double¬ 
single  bond-length  alternation.  These  y„’s  are 
compared  with  those  previously  calculated  for 
aW-trans  linear  polyenes  with  double-single  bond- 
length  alternation  [21],  and  for  a  new  set  calcu¬ 
lated  with  geometries  resulting  from  the  artificial 
imposition  of  no  double-single  bond-length  alter¬ 
nation.  The  structures  of  the  five  sets  of  mole¬ 
cules  are  shown  in  fig.  1. 

Our  study  of  the  Tr-electron,  third-order 
polarizabilities  for  the  molecules  shown  in  fig.  1 
consists  of  two  parts:  (1)  a  comparison  of  the 
calculated  and  measured  7(-3w;  w,  w,  <«>)’s  for 
the  linear  streptocyanines  to  extend  the  valida¬ 
tion  of  our  theoretical  method  beyond  that  of  the 
linear  polyenes  and  benzene  [21],  and  (2)  a 
comparison  of  the  calculated  y(-0;  0, 0, 0)’s  that 
emphasizes  analyzing  the  effect  of  terminal 
methyl  group  substitution  on  the  y„’s  for  the 
symmetric  cyanine  cations,  the  effect  of  bond- 


ALL-  mMS  LINEAR  CYANINES 
(n-O.  1.2. 3, 4) 

NO  DOUBLE-SINGLE  DOUBLE-SMGLE 

BONO^^NGTH  ALTERNATION  BOND-LENGTH  ALTERNATION 


ALL-  TRANS  LINEAR  POLYENES 
(n-O.  1.2.  3) 

NO  DOUBLE-SINGLE  DOUBLE-SINGLE 

BOND-LENGTH  ALTERNATION  BOND-LENGTH  ALTERNATION 


Fig.  1.  Structures  of  the  linear  cyanines  and  polyenes  with 
and  without  bond-length  alternation. 


length  alternation  on  the  signs  and  magnitudes  of 
the  y„’s  for  the  linear  cyanines  and  polyenes, 
and  the  dependence  of  |y„|  on  the  number  of 
ir-electrons  in  both  sets  of  molecules.  A  prelimi¬ 
nary  account  of  this  study  is  given  in  ref.  [22]. 

2.  Theoretical  method 

This  section  consists  of  three  parts.  The  first 
part  establishes  the  general  relationship  between 
the  macroscopic  third-order  electric  susceptibility 
and  the  microscopic,  molecular  third-order 
polarizability.  One  of  the  most  important  third- 
order  electric  susceptibilities  from  a  technologi¬ 
cal  point  of  view  is  the  light-induced  change  in 
refractive  index.  This  nonlinear  refractive  index 
is  related  to  the  molecular  third-order  polar¬ 
izability  in  the  second  part  of  the  section.  The 
third  and  final  part  describes  the  quantum  chemi¬ 
cal  calculation  of  the  nonresonant  rr-electronic 
component  of  the  third-order  molecular  polar¬ 
izability. 

2.1.  Relation  between  macroscopic  susceptibility 
and  molecular  polarizability 

The  mathematical  expression  for  the  polariza¬ 
tion  of  a  non-conducting,  non-magnetic  medium 
by  an  applied  external  electric  field(s)  is  given  by 
the  Taylor  series  expansion  [1,3] 
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'*'57  X ABC^ai^i)  ^ci^) 

B  C 

'*'57  2  2  S  A'  ABCD^ai^l  ) 

X  +  •  •  •  ,  (1) 

where  is  the  component  of  the  polarization 
vector  along  the  A  Cartesian  coordinate  axis  in 
the  laboratory  reference  frame;  is  the  angu¬ 
lar  frequency  of  the  polarization  field,  which 
contains  contributions  from  the  linear  and  non¬ 
linear  polarization  of  the  material,  e.g.,  <(>,,<(>,-(- 
o>2,  and  a>i  +  0)2  +  wj;  x^ll,  xTbc^  and  XaIcd  are 
the  real  components  of  the  electric  susceptibility 
tensors  of  rank  2,  3,  and  4,  respectively;  and 
Eai^^k)  is  the  a  =  A,  B,  or  C  component  of  the 
applied  electric  field  vector  oscillating  at  angular 
frequency  <u*.  All  terms  in  eq.  (1)  are  in  esu 
units.  The  1/2!  and  1/3!  factors  in  eq.  (1)  are 
often  incorporated  into  the  definition  of  Xabc^ 
and  x^ABCD-  Note  that  x^abc  's  zero  for  cen- 
trosymmetric  media. 

Most  organic  nonlinear  optical  materials  (e.g., 
organic  molecular  crystals)  are  composed  of  mo¬ 
lecular  units  whose  polarizabilities  sum  to  define 
the  macroscopic  electric  susceptibilities.  The 
electric  susceptibility  tensors  for  these  media  can 
then  be  expressed  as  a  function  of  the  number 
density  of  molecular  units  [/V],  a  term  containing 
the  local  electric  field  tensor  and  the  direction 
cosines  that  relate  the  molecule-fixed  coordinate 
axes  to  the  laboratory  coordinate  axes  [L„(0)], 
and  the  molecular  polarizability  tensor  of  rank 
«  +  l  [a'"’]: 

A''"'=/(/V,a‘'’U„(0))  (n  =  l,2,3).  (2) 

The  molecular  polarizability  tensors,  a*"*  («  = 
1,2,3),  are  more  commonly  represented  as  a 
(molecular  polarizability),  j8  (second-order  mo¬ 
lecular  polarizability),  and  y  (third-order  molec¬ 
ular  polarizability),  respectively.  For  weakly- 


coupled  molecules  randomly  oriented  in  a  given 
medium,  can  be  expressed  simply  as  [23,24] 

A'‘"'  =  A/<a*'’*)L„  (/i  =  l,2,3),  (3) 

where  L„  is  a  scalar  local-field  factor  defined  by 
the  refractive  indices  of  the  medium  at  the  fre¬ 
quencies  of  the  applied  electric  fields  and  the 
polarization  field,  and  (a*"*)  is  the  orientation- 
ally-averaged  molecular  polarizability  tensor. 
The  product,  (a*'*)Lp  defines  the  averaged 
first-order  dipole  moment  induced  at  the  given 
molecular  unit  by  an  applied  electric  field.  In  a 
similar  fashion,  the  products,  (a‘^*)L2  and 
define  the  averaged  second-  and  third- 
order  dipole  moments  induced  at  this  unit. 

2.2.  Light-induced  change  in  refractive  index 

The  third-order  nonlinear  optical  pb*-  !  lomenon 
of  a  light-induced  change  in  refractive  index  is 
fundamental  to  soliton  light-pulse  propagation  in 
optical  fibers  [5],  optical  switching  and  comput¬ 
ing  [6,7],  as  well  as  phase  conjugate  adaptive 
optics  [8].  The  dependence  of  the  real  refractive 
index  at  wavelength  A  on  the  incident  light  inten¬ 
sity  can  be  expressed  as 

n(  A,  I)  =  n„{  A)  +  An{\,I),  (4) 

where  /lo(^)  the  linear  refractive  index  at  A, 
and  A/j(A,  /)  is  the  change  in  refractive  index  as 
a  function  of  A  and  the  incident  light  intensity  I. 
The  leading  terms  in  the  series  expansion  for 
Art(A,  I)  are  n2iX)  I  and  n,h(A)  I,  so  that 

"(A,  I)  =  no{\)  -I-  [rt2(A)  +  n,h(A)]/  .  (5) 

The  nonlinear  refractive  index  coefficient, 
/12(A),  defines  the  change  in  refractive  index 
arising  from  the  electric-field  induced  polariza¬ 
tion  of  the  material,  while  the  /i,h(A)  coefficient 
describes  the  change  in  the  real  part  of  the 
refractive  index  resulting  from  the  light-induced 
heating  of  the  material,  i.e.,  the  thermo-optic 


B.  M.  Pierce  /  Linear  conjugated  it -electron  molecules 


55 


effect  [61.  The  thermo-optic  effect  is  to  be  avoid¬ 
ed  in  all  device  applications. 

The  n2{\)  coefficient  is  related  to  the  third- 
order  nonlinear  electric  susceptibility  tensor  ele¬ 
ment,  A'!48co(^)'  t’y  the  following  equation  [25]: 

«2(A)  =  5iT'Ar!4'«cD(A)/(cno(A)-) ,  (6) 

where  c  is  the  speed  of  light.  The  units  of  all 
parameters  in  eq.  (6)  are  in  esu.  The  expression 
for  n2(^)  eq.  (6)  is  appropriate  only  for  the 
Taylor  series  expansion  given  in  eq.  (1).  This 
third-order  nonlinear  optical  response  is  also 
known  as  the  optical  Kerr  effect.  According  to 
eq.  (3),  eq.  (6)  can  be  rewritten  to  directly  relate 
/12(A)  to  the  third-order  polarizability.  A,  for 
weakly-coupled  molecules  randomly  oriented  in 
a  given  medium: 

/i2(A)=  ^,Tr-A<y(A))/.2(A)/(c/i„(A)-).  (7) 

2.3.  Calculation  of  third-order  molecular 
polarizabilities 

Only  two  types  of  third-order  molecular 
polarizabilities  are  calculated  here,  the  ir -elec¬ 
tronic  component  of  the  dynamic  third-order 
polarizability  for  third-harmonic  generation, 
'y„(-3a/;  w,  a>,  w),  and  the  static  third-order 
polarizability,  7^(0;  0, 0, 0).  The  calculation  of 
■y„(-3w;  a>,  a»,  oi)  and  y„(0;  0,0,0)  for  a  given 
molecular  unit  proceeds  in  three  steps: 

(1)  determination  of  a  molecular  geometry, 

(2)  calculation  of  the  transition  energies,  perma¬ 
nent  state  dipole  moments,  and  transition  dipole 
moments  associated  with  the  ground  and  excited 
TT-electron  states  for  this  geometry,  and 

(3)  evaluation  of  the  sum-over-states  expression 
for  the  y^’s  using  these  transition  energies  and 
dipole  moments. 

The  TT-electron  bonding  networks  in  all  the 
molecules  in  fig.  1  were  kept  planar.  The  bond- 
lengths  and  bond-angles  used  for  these  molecules 
are  given  in  ref.  [22].  It  is  noted  that  the  tt- 
electron  polarizabilities  and  spectroscopic  prop¬ 


erties  of  the  molecules  are  not  very  sensitive  to 
the  lengths  of  the  N-H  and  C-H  bonds. 

The  INDO-SDCI  method  employed  in  the 
calculation  of  the  tt -electron  transition  energies, 
permanent  state  dipole  moments,  and  transition 
dipole  moments  for  the  various  molecules  is  the 
same  as  that  described  in  ref.  [21],  with  one 
exception.  This  exception  is  that  the  transition 
energies  were  calculated  using  the  formula  [26], 
^,(5/N)£i,,  where  E,  is  the  energy  of 
excited  state  /,  £„  is  the  ground  electronic  state 
energy,  and  N  is  the  number  of  conjugated 
ir-molecular  orbitals.  This  formula  scales  the 
ground  state  energy  to  compensate  for  the  imbal¬ 
ance  in  the  degree  to  which  the  ground  state  and 
the  'wir*  states  are  stabilized  by  single-  and 
double-excitation  interaction  [26]. 

Our  evaluation  of  the  sum-over-states  expres¬ 
sions  for  y„(— 3w;  a>,  to,  a>)  and  y„(0;0,0,0), 
and  the  isotropic  averages  of  these 
polarizabilities,  (y„(-3to;  to,  to,  to))  and 
(y,(0; 0,0,0)),  are  as  given  in  ref.  [21].  Unless 
indicated  otherwise,  y„(-3to;  to,  to,  to)  and 
y„(0; 0,0,0))  will  henceforth  be  taken  to  be 
synonymous  with  (y„(-3to;  to,  to,  to))  and 
(y„(0; 0, 0, 0).  The  jr-axis  of  the  molecule-fixed 
coordinate  systems  for  the  linear  molecules  in 
fig.  1  is  taken  to  be  coincident  with  a  line 
connecting  the  two  terminal  atoms  of  the  tt- 
electron  bonding  network.  As  a  result,  the  y,,„ 
component  is  expected  to  dominant  the  y^ 
tensor. 


3.  Results  and  discussion 

3.1.  Comparison  of  calculated  and  measured 
y„  (-3(0;  to.  to,  to)’s  for  linear  streptocyanines 

Our  theoretical  method  has  been  successful  in 
calculating  nonresonant  y^(-2to;  to,  to,  0)’s  for 
ethylene,  linear  polyenes,  and  benzene  that  are 
in  very  good  agreement  with  the  y’s  measured 
for  these  molecules  in  the  gas  phase  [21].  A 
similar  comparison  of  calculated  and  measured 
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■y’s  is  given  in  table  1  for  three  linear  strep- 
tocyanine  cations  of  different  lengths.  The  calcu¬ 
lated  values  of  and  y  in  table  1  are  rounded- 
up  to  the  most  significant  digit.  This  table  con¬ 
tains  the  y(-3o>;  a>,  a>)’s  measured  at  w  = 

1.908  jim  for  the  three  molecules  in  the  liquid 
phase  [15],  and  the  to,  to,  toys  calculated 

for  the  streptocyanines  with  both  symmetric  and 
asymmetric  geometries.  The  y{-3a>;  a>,  to,  tt>)’s 
for  the  two  geometries  were  calculated  since  it  is 
unclear  which  geometry  was  stabilized  by  the  Cl" 
counterion  and  the  solvent  used  in  the  ex¬ 
perimental  samples.  In  this  regard,  Fabian  and 
Mehlhorn  [27]  comment  that  the  asymmetric 
geometries  seen  experimentally  [28]  in  crystals  of 
streptocyanines  may  be  due  to  a  non-symmetric 
arrangement  of  the  counterion  to  the  chain  in 
the  crystal.  Note  that  the  calculated 
y{-3(o-,  (o,  to,  4<j)’s  in  table  1  are  separated  into 
O'-  and  TT-electron  components.  The  y^  term  was 
estimated  using  the  bond-additivity  approxi¬ 
mation  described  in  ref.  [21],  while  the  y„  term 
was  that  calculated  for  the  unmethylated 
cyanines. 

As  shown  in  table  1,  the  magnitudes  of  the 
measured  -y’s  and  those  calculated  for  either  the 
symmetric  or  asymmetric  streptocyanines  are  in 
rough  agreement.  The  positive  signs  of  the  -y’s 
measured  for  the  two  shorter  streptocyanines  can 
be  explained  in  terms  of  the  stabilization  of  the 
symmetric  geometries  by  the  Cl  ~  counterion  and 
solvent,  while  the  negative  sign  of  the  y  mea¬ 


sured  for  the  longest  streptocyanine  can  be  ex¬ 
plained  in  terms  of  the  stabilization  of  the  asym¬ 
metric  geometry.  This  interpretation  of  the  mea¬ 
sured  data  needs  to  be  investigated  further. 
Recent  measurements  of  non-resonant  third- 
harmonic  generation  in  liquid  solutions  contain¬ 
ing  cyanine  molecules  confirm  the  above  theoret¬ 
ical  prediction  that  the  sign  of  y  for  these  mole¬ 
cules  can  be  negative,  and  that  the  sign  and 
magnitude  of  y  should  be  very  sensitive  to  the 
dielectric  properties  of  the  solvent  [29]. 

3.2.  Comparison  of  calculated  y„(0;  0,  0,  0)'s 

The  |'y„(0;  0, 0, 0)|’s  calculated  for  the  five  sets 
of  molecules  in  fig.  1  are  plotted  on  a  logarithmic 
scale  in  fig.  2  as  functions  of  the  number  of 
IT -electrons  in  each  molecule.  The  sym¬ 

metric,  linear  streptocyanines  with  no  bond- 
length  alternation  are  designated  LCl  in  fig.  2; 
LC2  is  the  label  for  the  symmetric,  linear 
cyanines  with  no  bond-length  alternation;  LC3 
represents  the  asymmetric  linear  cyanines  with 
bond-length  alternation;  LPl  designates  the 
linear  polyenes  with  bond-length  alternation; 
and  LP2  represents  the  linear  polyenes  with  no 
bond-length  alternation.  A  dashed  line  in  fig.  2 
indicates  a  negative  7^(0;  0,0,0),  and  a  solid 
line  a  positive  y„(0;  0,0,0). 

In  regard  to  the  sign  of  ■y„(0;  0,0,0)  for  the 
different  molecules,  the  symmetric  linear 
cyanines  with  and  without  terminal  methyl 


Table  1 

Calculated  and  measured  -y's  for  selected  linear  strep¬ 
tocyanines. 

Molecule  Theory  ( 1 0 "  esu)  Expt  ( 10  ‘  esu) 


2(Me)  N  (M8)2 

2(Me)N'''‘==^-•'-^N  (M9)2 

2(M«)N?’"i44|:N(Me)2 

2(Me)N--=<>^')^N(Me)2 

2(M«)N^"4^ — )^N(Me)2 


■y.,  y,  y 


2.8 

2.8 

3.3 

3.3 

3.7 


-6 

-1 

-80 

40 

-800 


-3 

2 

-80 

40 

-800 


0.5-3  ±0.4 


52  ±  1 


-510  +  50 


NUliieER  OF  x-ELECTRONS 


Fig.  2.  Calculated  l-y„(0;0.  0,  0)l’s  for  the  five  sets  of  mole¬ 
cules  shown  in  fig.  1. 


3.7  1000  1000 
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groups  all  have  negative  ^^(O;  0, 0, 0)’s;  the 
asymmetric  cyanines  switch  from  negative  to 
positive  signs  after  =  8,  or  as  the  chain 
length  increases  past  hve  carbon  atoms;  and  both 
sets  of  linear  polyenes  have  positive 
y„(0;0,0,0)’s. 

In  regard  to  the  relative  magnitudes  of 
7^(0;  0,0,0)  for  the  different  molecules,  the 
|'y„(0;  0, 0, 0)j's  for  the  symmetric  linear  cyanines 
start  out  less  than  those  for  the  two  sets  of  linear 
polyenes,  but  become  much  greater  than 
|7'„(LP1)1  and  |y„(LP2)|  after  =6.  This  be¬ 
havior  reflects  the  greater  power  law  dependence 
of  the  symmetric  cyanines  on  than  that  of 
the  linear  polyenes,  i.e.,  |-y„(LCl)j « 
|y„(LC2)|  a  Nil  |y.(LPl)l  -  and 

ly„(LP2)|  «  Nil  where  =  4,  6,  8,  10  for  the 
polyenes  and  N„.,  =  4,  6,  8,  10,  12  for  the 
cyanines.  Terminal  methyl  group  substitution  for 
the  symmetric  linear  cyanines  results  in  a  slight 
increase  in  |'y„(0;  0, 0, 0)|  at  each  value  of  N„_^. 

Decreasing  bond-length  alternation  for  the 
linear  polyenes  acts  to  increase  |y„(0;0, 0, 0)|  at 
each  value  of  This  result  is  reasonable  since 
a  lower  degree  of  bond-length  alternation  will 
smooth  out  the  potential  energy  surface  of  the 
TT-electrons  along  the  polyene  chain. 

Perhaps  the  most  interesting  result  shown  in 
fig.  2  is  the  dramatic  effect  that  molecular  geom¬ 
etry  has  upon  the  ^^(O;  0, 0, 0)’s  for  the  linear 
cyanine  cations.  Moving  between  the  symmetric 
geometry  with  no  bond-length  alternation  and 
the  asymmetric  geometry  with  bond-length  alter¬ 
nation  can  result  in  large  changes  in  the  sign 
and/or  magnitude  of  -y„(0;  0, 0, 0).  These 
changes  in  geometry  can  be  effected  by  altering 
the  dielectric  properties  of  the  environment  of 
the  molecule,  e.g.,  variation  in  the  polarity  of 
the  solvent  in  which  the  molecule  is  dissolved. 


4.  Summary  and  conclusions 

The  TT -electron,  third-order  polarizabilities  for 
a  set  of  polymethines  and  a  set  of  polyenes  were 
calculated  and  analyzed.  The  set  of  polymethines 


comprised  a\l-trans,  linear  symmetric  cyanine 
and  streptocyanine  cations  with  no  double-single 
bond-length  alternation,  and  cyanine  cations 
with  asymmetric  geometries  resulting  from  the 
artificial  imposition  of  double-single  bond-length 
alternation.  The  set  of  polyenes  consisted  of 
all-rroAis  linear  polyenes  with  double-single  bond- 
length  alternation,  and  a.l\-trans  linear  polyenes 
with  geometries  resulting  from  the  artificial  im¬ 
position  of  no  double-single  bond-length  alterna¬ 
tion.  The  theoretical  analysis  consisted  of  two 
parts:  (1)  a  comparison  of  the  calculated  and 
measured  y{-3(o;  to,  to,  ft>)’s  for  the  linear  strep- 
tocyanines  to  extend  the  validation  of  our  theo¬ 
retical  method  beyond  that  of  the  linear  polyenes 
and  benzene  [21],  and  (2)  a  comparison  of  the 
calculated  'y„(-0;  0, 0, 0)’s  that  emphasizes 
analyzing  the  effect  of  terminal  methyl  group 
substitution  on  the  y„’s  for  the  symmetric 
cyanine  cations,  the  effect  of  bond-length  alter¬ 
nation  on  the  signs  and  magnitudes  of  the  y„’s 
for  the  linear  cyanines  and  polyenes,  and  the 
dependence  of  |y„|  on  the  number  of  ir-electrons 
in  both  sets  of  molecules. 

The  y„’s  of  the  linear,  symmetric  and  asym¬ 
metric  cyanines  are  calculated  to  be  substantially 
different  from  those  of  the  linear  polyenes.  In 
regard  to  the  sign  of  y„(0;  0, 0, 0),  the  symmetric 
linear  cyanines  with  and  without  terminal  methyl 
groups  all  have  negative  y„(0;  0, 0, 0)’s;  the 
asymmetric  cyanines  switch  from  negative  to 
positive  signs  as  the  chain  length  increases  past 
five  carbon  atoms;  while  both  sets  of  linear 
polyenes  have  positive  y„(0;  0, 0, 0)’s.  In  regard 
to  the  magnitude  of  y„(0;  0, 0, 0),  the 
|y„(0;  0, 0, 0)|’s  for  the  symmetric  linear  cyanines 
start  out  less  than  those  for  the  two  sets  of  linear 
polyenes,  but  become  much  greater  after  the 
number  of  7r-electrons  exceeds  six.  This 

behavior  reflects  the  greater  power  law  depen¬ 
dence  of  the  symmetric  cyanines  on  than 
that  of  the  linear  polyenes,  i.e.,  |y„ (symmetric 
cyanines)!  a  =  4,  6,  8,  10,  12  and  ly„(linear 
polyenes)!  =  4,  6,  8,  10.  The  polymethines 

in  general,  and  the  cyanines  in  particular,  com¬ 
prise  a  very  promising  class  of  nonlinear  optical. 


58 


B.M.  Pierce  /  Linear  conjugated  tr -electro  i  molecules 


■n -electron  molecules  that  merit  further  ex¬ 
perimental  study. 
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An  improved  version  of  the  Davydov  soliton  model  to  describe  the  physical  mechanisms  of  the  energy  transfer  in 
proteins  has  been  introduced.  The  essential  modifications  have  been  done  for  the  lattice  subsystem:  (1)  the  intrapeptide 
N — H  bond  is  allowed  to  move  together  with  its  peptide  group  and  therefore  both  the  acoustic  and  optical  modes  appear  to 
be  coupled  and  (2)  a  harmonic  on-site  potential,  which  describes  the  influence  of  the  environment  of  'he  chain,  is 
introduced.  The  chain  is  considered  to  be  in  contact  with  a  heat  bath  only  via  the  acoustic  mode,  while  the  coupling  of  the 
amide-I  excitation  with  the  optical  mode  is  shown  to  be  crucial  for  the  thermal  stability  of  the  Davydov  soliton.  The 
presence  of  a  substrate  on-site  potential,  modelling  the  influence  of  the  environment,  is  shown  to  improve  the  soliton 
stability  with  respect  to  thermal  oscillations  since  it  essentially  reduces  the  amplitude  of  acoustic  oscillations. 


1.  Introduction 

The  study  of  the  mechanisms  for  energy  and 
charge  transport  in  biological  systems  represents 
one  of  the  important  aspects  of  bioenergetics.  In 
1973,  Davydov  suggested  a  soliton  mechanism 
[1]  to  explain  the  efficient  transfer  of  the  vi¬ 
brational  amide-I  energy  in  proteins.  According 
to  his  model,  solitary  waves  (or  solitons)  are 
carriers  of  the  intramolecular  high-frequency 
vibrational  amide-I  energy  which  can  be  excited 
inside  each  peptide  group  (PG)  H — N — C=0 
involving  mainly  longitudinal  oscillations  of  the 
C=0  bond  (for  a  comprehensive  review  see 
Scott  [2]).  The  formation  mechanism  of  the 
Davydov  soliton  is  based  on  the  balance  between 
the  effects  of  dispersion  caused  by  the  resonance 
interaction  of  adjacent  PGs  and  the  interaction 
of  the  amide-I  mode  with  longitudinal  deforma¬ 
tion  (acoustic  mode)  of  the  chain  (self-trapping 
mechanism). 

The  possibility  of  the  successful  application  of 
the  concept  of  the  Davydov  soliton  in  biology 


depends  on  its  stability  at  physiological  tempera¬ 
tures,  i.e.  at  T^SOOK.  Therefore  its  thermal 
stability  has  recently  been  the  subject  of  inten¬ 
sive  studies  [3-10].  Thus,  for  instance,  the  nu¬ 
merical  simulations  [4,5]  on  the  basis  of  the 
Langevin  approach  discover  the  thermal  de- 
stabilization  of  the  Davydov  soliton  at  room 
temperatures.  As  follows  from  these  calcula¬ 
tions,  the  amplitude  of  thermal  vibrations  of  PGs 
exceeds  more  than  one  order  the  amplitude  of  a 
localized  deformation  of  the  chain.  In  this  case, 
thermal  vibrations  result  in  the  break  of  the 
coupling  of  the  amide-I  excitation  with  the  local 
deformation  of  the  chain  and  as  result,  the 
soliton  (self-trapping)  state  disappears. 

However,  the  interaction  of  the  intramolecular 
amide-I  excitation  with  other  low-frequency  in¬ 
tramolecular  vibrations  should  improve  the 
soliton  stability.  The  period  of  such  optical 
vibrations,  compared  to  acoustic  oscillations,  is 
much  less  than  the  time  for  the  excitation  to  be 
transferred  from  one  PG  to  its  neighbor.  The 
excitation  is  sensitive  to  these  optical  oscillations 
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in  an  averaged  way,  responding  only  on  the 
motion  of  their  equilibrium  positions.  Therefore, 
in  contrast  to  thermal  acoustic  oscillations,  ther¬ 
mal  optical  vibrations  should  not  exert  any 
visible  destabilization  influence  on  the  Davydov 
soliton. 

In  this  paper  the  dynamics  of  the  self-trapping 
state  of  the  amide-I  excitation  in  a  thermalized 
chain  is  studied  provided  that  it  interacts  with 
both  the  acoustic  and  optical  modes.  The  inter¬ 
action  of  the  chain  with  a  heat  bath  is  assumed  to 
be  described  by  the  Lange vin-type  equation.  The 
phenomenological  damping  and  the  color  noise, 
which  model  the  interaction  of  the  chain  with  the 
bath,  are  involved  only  into  the  acoustic 
subsystem,  while  the  optical  subsystem  is  sup¬ 
posed  to  be  thermalized  via  its  coupling  with 
acoustic  oscillations.  The  interaction  of  the 
amino  acids  with  its  local  environment  chosen  in 
the  form  of  a  harmonic  on-site  potential  [11]  is 
also  included  into  our  studies  of  temperature 
effects. 

The  best  appropriate  material  which  may  be 
described  on  the  basis  of  this  model  is  the  crys¬ 
talline  acetanilide  (ACN)  (GHjCONHC^Hj)^ 
with  parameter  values  very  close  to  those  of 
polypeptide  protein  chains  [12,13].  Experimen¬ 
tally,  in  the  crystalline  ACN  the  amide-I  excita¬ 
tion  is  observed  as  the  maximum  of  the  infrared 
absorption  1667  cm” ‘.  Besides  this  band  in  the 
infrared  and  Raman  spectra  for  ACN,  another 
band  with  the  red  shift  about  17  cm  ” '  has  been 
found  in  the  experiments.  The  suggestion  has 
been  done  that  this  band  is  caused  by  the 
interaction  of  the  amide-I  with  an  intrapeptide 
displacement  of  the  proton  in  the  N — H  bond 
and  it  may  be  identified  with  Davydov-like 
soliton  states  stabilized  by  this  interaction.  Thus, 
to  describe  the  soliton  transport  of  the  amide-I 
energy  along  the  chain  of  hydrogen-bonded  PGs 
both  the  couplings  of  this  excitation  with  the 
deformation  of  the  hydrogen  bonds  and  with  the 
intrapeptide  proton  displacements  should  be 
considered. 


2.  A  modified  Davydov  model 

Consider  a  typical  one-dimensional  chain  of  N 
hydrogen-bonded  peptide  groups.  Suppose  that 
along  the  axis  x  =  nl,  where  /  is  the  lattice 
spacing,  PGs  are  placed  in  the  sites  with  n  = 
I, ...  ,N.  Besides  the  longitudinal  displacements 
of  PGs  as  whole  objects,  the  internal  motion  of 
the  proton  in  the  N — H  bond  is  also  considered. 
Schematicc '.ly  this  lattice  model  is  shown  in 
fig.  1. 

The  standard  quantum  mechanical  description 
of  this  model  can  be  reduced  in  the  adiabatic 
approximation  to  the  classical  Lagrangian 

^  ~  ■>  y  n’’  y  n} 

=  +  (1) 

composed  of  three  components.  The  first  of 
these,  the  exciton  Lagrangian,  is 

n=  I 

+  (2) 

where  the  dot  denotes  the  differentiation  with 
respect  to  time  t,  E(,  is  the  amide-I  site  energy 
(Eq  =  0.205  eV),  -J  is  the  nearest  neighbor 
dipole-dipole  coupling  energy  and  \<f>„\^  defines 
the  probability  to  find  a  quantum  of  the  amide-I 
excitation  at  the  nth  PG,  so  that  the  discrete 
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Fig.  1.  Schematical  representation  of  the  chain  mode!  of 
hydrogen-bonded  PGs. 
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wave  function  is  supposed  to  be  normalized 

as  =  1- 

The  second  term,  the  lattice  Lagrangian,  in¬ 
cludes  two  modes:  longitudinal  displacements  of 
PGs  as  whole  objects  (acoustic  mode)  and  in¬ 
trapeptide  optical  oscillations  of  the  proton  in 
the  N — H  bond  with  low  frequencies  as  com¬ 
pared  with  the  frequency  of  the  amide-I  oscil¬ 
lators.  In  comparison  with  the  previous  studies 
[4,5],  the  following  two  essential  modifications 
are  made  for  this  chain  model.  First,  the  N — H 
bond  is  assumed  to  move  together  with  its  PG 
and  therefore  both  the  acoustic  and  optical 
modes  come  to  be  coupled.  Second,  a  substrate 
on-site  potential  with  single-minimum  topology 
is  introduced  [11]  which  models  the  interaction 
of  the  chain  with  its  environment  as  shown  in  fig. 
1.  Thus, 

V 

=  S  +  -yj 

(3) 

where  m  and  M  are  reduced  masses  of  the 
proton  and  PG.  respectively;  x„  is  the  proton 
displacement  in  the  N — H  bond  (with  the  stiff¬ 
ness  constant  K)  of  the  nth  PG  and  y„  is  the 
longitudinal  displacement  of  this  PG;  the  Morse 
potential  V{p)  =  “O'  describes  the  cou¬ 

pling  between  adjacent  PGs  and  is  the  energy 
of  the  hydrogen  bond.  For  small  deformations  of 
this  bond  K(p)  =  4kp'  where  k  =  F"(0)  is  the 
stiffness  constant  of  the  hydrogen  bond.  The 
phenomenological  parameter  b  for  the  Morse 
potential  can  be  calculated  according  to  the 
relation  b  =  sJkHe^.  The  last  term  describes  the 
interaction  of  the  nth  PG  with  the  substrate 
is  the  stiffness  constant). 

The  interaction  part  of  the  Lagrangian  de¬ 
scribes  the  coupling  of  the  amide-I  with  both  the 
acoustic  and  optical  modes  and  it  is  written  in 
the  standard  form 

v 

=  S  [x^ix„-y„)  +  X2{yn-yn^l)]<t>*n<t>n  ^  (4) 

n  =  1 


where  and  are  the  coupling  constants  of  the 
amide-I  with  deformation  of  the  N — H  bond  and 
of  the  hydrogen  bond,  respectively. 

Then  the  equations  of  motion  which  corre¬ 
spond  to  the  Lagrangian  (l)-(4)  become 

ih^„  =  [£o  +  Ari(>’„  -  -f  J  +  Ar2(>'«-i  - 

-  .  (5) 

mx„  =  K(y„-x^)-i-xM’ ,  (6) 

My„  =  V\y„^,-y„)-V\y„-y„_,) 

-K{y„-x„)-Kj„-xM' 

-rMy„  +  at).  (7) 

where  the  prime  denotes  the  differentiation  of 
the  function  V  with  respect  of  its  argument.  Here 
eq.  (7)  has  the  form  of  the  Langevin  equation 
describing  the  interaction  of  PGs  via  the  amino 
acid  residues  with  the  thermal  bath.  As  for  the 
N — H  optical  oscillations,  they  are  thermalized 
via  thermal  motions  of  PGs.  The  thermal  bath 
with  temperature  T  is  described  by  the  relaxa¬ 
tion  time  {,=  lir  and  the  normally  distributed 
random  forces  („{i)  with  the  correlation  function 
<C(0  =  2MrksTil/{t  -  where  the 

autocorrelation  function  <p(r)  is  normalized  as 
jr*  iA(f)  dr  =  1.  Using  a  color  noise  in  eq.  (7)  we 
are  able  to  model  the  “whole"  thermalization  of 
acoustic  modes  and  a  “partial”  thermalization  of 
optical  modes.  Since  some  analytical  work  can 
be  done  for  the  exponential  color  noise  (<A(r)  = 
exp(-|rl/rj/2rj,  it  is  convenient  to  use  it  for  our 
purposes  and  in  this  case  we  may  complete  the 
set  of  eqs.  (5)-(7)  by  the  equation 

L=t~v'[m-v„{n].  (8) 

where  the  t7„’s  are  the  delta-correlated  random 
forces  (white  noise),  so  that  {yimiOVnU'))  ~ 
2Mrk^T8it-nS„„. 
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3.  Numerical  simulations  of  the  soliton 
dynamics 

For  numerical  studies  of  the  soliton  model 
we  have  considered  the  chain  of  N  =  50  PGs 
with  periodic  boundary  conditions  and  the 
following  parameter  values:  7  =  1.55  x  10  "  J, 
£„  =  0.17eV=2.72x  10-'’J,  m  =  m  =  1.67x 
10“-' kg,  Af  =  113.2mp  =  1.89X  lO'-^'kg,  k  = 
13  N/m,  I  =  4.5  A.  Calculations  for  the  acoustic 
and  optical  modes  with  K  =  30.5  N/m  and  = 
4  N/m  give  in  the  linear  limit  the  following 
frequency  bands:  24.3 cm“‘ < < 90.9 cm”' 
and  719.86cm“' </}„p<719.91cm“'.  The  Eins¬ 
tein  temperatures  can  also  be  calculated  and  we 
have  130.7  K  for  the  acoustic  mode  and  1035.8  K 
for  the  optical  mode. 

The  damping  coefficient  is  determined  by  the 
relation  F  =  6TTbrj/ M  where  =  1  10  A  is  the 

characteristic  linear  size  of  an  amino  acid  residue 
and  Ti  is  the  viscosity  of  medium.  For  water 
Tj  =  10”' kg/m  s,  therefore  r  =  (l-10)x 
10"s'‘.  We  choose  the  minimal  value  r  = 
10''cm”‘  which  corresponds  to  the  relaxation 
time  /,  =  0. 1  ps. 

First,  we  study  pure  soliton  solutions  in  the 
unthermalized  chain  by  using  the  steepest-des- 
cent  method  [11].  We  write 

-  vt)  exp{i[kn  -  (£„  +  A)r/A]}  ,  (9) 

where  is  a  smooth  real  function,  v  is  the 
soliton  velocity,  k  is  the  wave  number,  and  A  is  a 
real  parameter.  Substituting  (9)  into  eq.  (5)  we 
find  the  soliton  velocity  spectrum:  u  =  sin  k 
where  v^^  =  2lJlh  is  the  maximum  velocity  of 
excitons.  Also,  we  derive  the  discrete  stationary 
Schrodinger  equation  with  the  spectral  parame¬ 
ter  A  which  can  be  calculated  by  using  the 
normalization  condition  for  the  envelope  tp^. 
Next,  inserting  (9)  into  the  Lagrangian  (l)-(4) 
and  using  the  approximate  relations  <p„  = 
-  (P„)IL  x„  =  -x„)/l,  and  = 

+  i  -y„)IL  we  obtain  the  conditional  mini¬ 
mum  problem  min :  =  1  in  the  con¬ 
figuration  space  {(p„,x„.  (for  more  details 


see  Zolotaryuk  et  al.  (111).  Starting  the  descent 
from  the  point  corresponding  to  the  localization 
of  the  excitation  at  one  PG  we  obtain  the  set 
describing  a  self-trapping  state 
with  its  localization  width  L  =  (E^=, 

Next,  this  set  is  used  as  initial  conditions  to 
simulate  eqs.  (5)-(7)  taken  without  damping  and 
forcing  terms  [11]. 

As  was  shown  previously  [11],  at  ^"1=0  ^nd 
K^>0  the  soliton  exists  only  for  large  values  of 
X2-  For  its  small  values  there  exists  only  the 
stable  exciton  state  (note  that  at  =  0  the 
Davydov  soliton  is  stable  while  the  exciton  is 
unstable  for  all  values  of  The  descent 
calculations  have  shown  that  in  the  chain  with 

=  0  the  static  soliton  solution  exists  only  if 
X2  ^  0.8  X  10”"’  N  while  for  >0  it  exists  for  all 
the  values  of  Xz-  Besides  this,  with  increasing  X\ 
(at  a  fixed  value  of  Xi)  s^d  X2  ^  fixed  value  of 

),  the  soliton  energy  and  its  width  L  are 
monotonically  decreased.  Also,  with  increasing 
the  soliton  velocity  u,  the  soliton  energy  is 
increased  while  the  width  L  is  decreased. 

The  motion  of  the  amide-I  excitation  in  the 
thermalized  cyclic  chain  is  governed  by  the 
system  of  equations  (5)-(8).  The  dynamical 
simulations  have  been  carried  out  for  the  chain 
of  A  =  50  PGs  with  the  value  Xi  =  1  ^  10”"*  N  to 
be  fixed  and  Xi  to  be  varied.  The  amplitude  of 
thermal  oscillations  for  the  acoustic  sublattice 
exceeds  by  one  order  the  deformation  amplitude 
caused  by  the  amide-I  excitation.  On  the  back¬ 
ground  of  these  oscillations  a  local  deformation 
of  the  acoustic  sublattice  is  not  sensible  for  the 
excitation  and  the  self-trapping  state  has  to  be 
stablilized  only  by  the  coupling  of  the  excitation 
with  a  local  deformation  of  the  optical  sublattice. 
Therefore  the  initial  conditions  for  the  set  of 
equations  (5)-(7)  which  correspond  to  a  local¬ 
ized  self-trapping  state  can  be  chosen  as 
follows:  <^„(0)  =  tpl  e’*",  x„(0)  =  £„  +  x”,  y„(0)  = 
yn+y'n^  x„{0)  =  x„-v{xl^^  -jr")//,y„(0)  =  y„  - 

^{yl^i-y:)ii^  =  L  n  =  i,  2,  .  .  .  ,  A, 

where  v  is  the  soliton  velocity,  k  =  arcsin(i;/t;gj, 
{<p”,  Jt",  y“}^=,  is  the  “pure”  soliton  solution  of 
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the  minimization  problem,  and  {x„,  y„,  x„,  y„, 
is  a  “lattice”  solution  of  eqs.  (6)-(8)  at 
the  instant  of  time  r  =  50  ps,  when  the  lattice  is 
already  thermalized,  i.e.,  it  is  found  in  the 
thermal  equilibrium  with  the  heat  bath. 

Take  at  the  initial  instant  of  time  the  static 
soliton  solution  (u  =  0).  In  the  chain  with  ^^2  =  0 
the  soliton  is  stable  against  to  the  thermal 
oscillations  of  PGs  with  any  values  of  the  corre¬ 
lation  time  For  the  values  r^  =  0.1;  0.07; 
0.04  ps  the  soliton  width  practically  is  not 
changed  while  for  the  values  /^  =  0.01;  0.001  ps 
its  insignificant  oscillations  are  observed  (fig.  2). 
This  can  be  explained  if  we  notice  that  the 
period  of  the  optical  oscillations  (~0.046ps)  is 
less  by  one  order  than  the  minimum  time  which 
the  excitation  needs  to  be  transferred  from  one 
unit  cell  of  the  chain  to  its  neighbor  - 
0.34  ps).  The  intramolecular  excitation  is  sensi¬ 
tive  to  such  oscillations  only  in  the  averaged 
way,  responding  only  on  the  change  of  the 
centers  of  vibrations. 

In  the  chain  with  '"N,  thermal 

vibrations  lead  only  to  insignificant  oscillations  of 
the  soliton  width  (fig.  3)  while  the  strong  inter¬ 
action  of  the  excitation  with  the  acoustic 
subsystem  (jfz  “  ^  10“"  N)  results  in  the  decay 
of  the  soliton  (fig.  4).  This  can  also  be  explained 
if  we  note  that  the  period  of  the  acoustic  oscilla¬ 
tions  has  the  same  order  as  the  time  for  the 
excitation  to  propagate  along  the  chain  one  unit 
cell.  In  this  case,  the  excitation  cannot  follow  the 
large-amplitude  deformation  caused  by  thermal 


Fig.  2.  Time  dependence  of  the  soliton  width  Hv  =  0)  in  the 
thermalized  chain  with  =  1  x  10  N  and  =  0  at  =  0. 1 ; 
0.07;  0.04;  0.01;  0.001  ps  (curves  1,  2,  3,  4,  5,  respectively). 


Fig.  3.  Time  dependence  of  the  soliton  width  L  (u  =  0)  in  the 
thermalized  chain  with  ;y,  =  1  x  10’'“N  and  =  1  x  10"'^  at 
f,.  =  0.1;  0.07;  0.04;  0.01;  0.001  ps  (curves  1,  2,  3,  4,  5, 
respectively). 


Fig.  4.  Time  dependence  of  the  localization  width  L  for  the 
static  self-trapping  state  in  the  thermalized  chain  with  ;ri  = 
1  X  10  ‘“N  and  .r,  =  4x  10'"  N  at  /,  =  0.1;  0.07;  0.04;  0.1; 
0.001  ps  (curves  1.  2,  3,  4,  5.  respectively). 

oscillations  of  the  molecules  which  appear  ran¬ 
domly  along  the  chain. 

4.  Concluding  remarks 

To  study  the  thermal  stability  of  the  Davydov 
soliton  at  biological  temperatures,  we  have  con¬ 
sidered  the  model  with  the  interaction  of  the 
amide-I  excitation  with  both  the  acoustic  and 
optical  modes.  Since  the  intrapeptide  N — H 
bonds  are  allowed  to  move  together  with  their 
PGs,  the  acoustic  and  optical  oscillations  beconie 
to  be  coupled.  Therefore,  the  optical  oscillations 
can  be  thermalized  via  the  acoustic  mode  for 
which  the  temperature  description  is  introduced 
in  the  standard  Langevin  form. 

The  dynamical  semiclassical  simulations  of  the 
time  evolution  of  the  intrapeptide  amide-I  exci¬ 
tation  have  shown  that  thermally  stable  soliton 
(self-trapping)  states  can  exist  if  they  are  stabil- 
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ized  by  the  coupling  of  the  excitation  with  the 
intrapeptide  displacements  (optical  mode)  while 
the  coupling  with  deformation  of  hydrogen 
bonds  (acoustic  mode)  is  shown  to  have  de¬ 
stabilizing  influence.  At  the  values  Xi  ~  (^~5)  x 
10~“N  given  by  quantum-mechanical  calcula¬ 
tions  (for  references  see  Scott  [2])  the  soliton 
appears  to  be  unstable  with  respect  to  thermal 
oscillations.  Therefore,  on  the  surface  of  a 
globular  protein,  where  thermal  oscillations  of 
hydrogen  bonds  have  large  amplitudes,  the 
soliton  cannot  survive  at  room  temperatures.  On 
the  other  hand,  globular  proteins  are  arranged 
with  high  density  and  therefore  the  stiffness  of  a 
substrate  inside  a  globule  can  significantly  exceed 
the  value  ATg  =  4  N/m.  In  this  case  the  frequency 
of  acoustic  vibrations  is  increased  while  their 
amplitude  is  decreased  by  one  order  (~0.01  A). 
At  these  conditions  the  soliton  becomes  stable 
with  respect  to  the  thermal  oscillations.  Thus, 
the  on-site  potential,  which  models  substrate 
influence  [11],  plays  an  important  role  for  the 
thermal  stability  of  the  Davydov  soliton. 
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The  thermal  stability  of  soliton  solutions  of  the  semiclassical  Davydov  model  for  energy  transfer  in  proteins  is 
investigated.  Thermal  averaging  is  made  taking  the  degeneracy  of  the  states  into  account.  It  is  shown  that  Langelin 
dynamics  leads  to  the  same  average  states  as  the  semiclassical  Monte  Carlo  simulations,  indicating  that  the  former  do 
constitute  an  accurate  representation  of  the  thermal  bath  in  the  semiclassical  approximation. 


1.  Introduction 

In  1973,  Davydov  [1]  proposed  a  mecha¬ 
nism  for  energy  transfer  in  proteins,  according 
to  which  the  energy  released  in  the  hydrolysis 
of  adenosine  triphosphate  can  propagate  along 
an  enzyme  a-helix  in  the  form  of  a  soliton.  It  is 
now  well  established  that  the  Davydov  Hamil¬ 
tonian  does  possess  soliton  solutions  at  T =0K. 
An  essential  question  is,  however,  whether  at 
physiological  temperatures  the  Davydov  soliton 
lasts  long  enough  to  have  a  biological  role  [2- 
4].  Simulations  with  Langevin  equations  lead 
some  authors  [2,3]  to  the  conclusion  that,  at 
7’=310K,  the  Davydov  soliton  disappears  in 
a  few  picoseconds.  This  conclusion  is  depen¬ 
dent  on  an  accurate  simulation  of  the  effects 
of  the  thermal  bath.  Recently  a  test  to  measure 
the  accuracy  of  the  thermal  bath  representation 
has  been  proposed  [6].  It  aims  at  comparing 
average  results  from  the  Langevin  dynamics 
(LD)  with  corresponding  results  from  the  exact 
Monte  Carlo  (MC)  simulations.  This  article  is 
concerned  with  the  results  of  this  test. 


2.  The  semiclassical  Davydov  model 

The  Hamiltonian  used  here,  designated  as  the 
semiclassical  Davydov  Hamiltonian  H,  is,  for  a 


one-dimensional  lattice  with  N  sites, 

W  +  //ph  +  Ant,  (1) 

where  Ax  is  the  exciton  Hamiltonian,  //ph  is  the 
phonon  Hamiltonian  and  Ant  is  the  interaction 
Hamiltonian.  The  exciton  Hamiltonian  Ax  is 

Aex  =  53  +  2j2„+i)], 

^=1,^ 

(2) 

where  e  is  the  intramolecular  excitation  energy, 
-J  is  the  dipole-dipole  interaction  energy  be¬ 
tween  neighbouring  sites  and  (a„ )  is  the  cre¬ 
ation  (annihilation)  operator  for  an  excitation 
in  site  n.  The  interaction  Hamiltonian  Am  is 

Ant=  X]  lx'*'  iu„+i-u„)ala„ 

n  =  l,N 

+x~  iHn-  (3) 

where  x'^ix~)  is  an  anharmonic  parameter  re¬ 
lated  to  the  coupling  between  the  excitation  and 
the  displacement  u„  of  the  following  (preceding) 
site.  The  Hamiltonian  is  called  semiclassical  be¬ 
cause  the  lattice  is  considered  a  classical  system, 
which  is  here  expressed  by  treating  the  displace¬ 
ments  u„  (and  below  the  momenta  P„ )  of  the 
lattice  as  real  variables,  while  the  intramolecular 
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excitation  is  described  by  the  operators  ai,a„. 
This  difference  is  marked  by  the  hats  above  the 
operators.  Thus,  the  phonon  Hamiltonian  is 


where  P„  is  the  momentum  of  site  n,  M  is  the 
mass  of  each  site  and  k  is  the  elasticity  constant 
of  the  lattice.  The  exact  general  solution  of  the 
Schrodinger  equation  for  the  first  excited  state 
of  the  semiclassical  Davydov  (1-4)  is 

I  \l/{{Un},{Pn),{4>n)j))  = 

E  <t>n({Un),{Pr,},t)ai\Q),  (5) 

n=\,S 

where  no  form  is  set  a  priori  to  4>„.  Exact  mini¬ 
mum  energy  first  excited  states  of  the  semiclas¬ 
sical  Davydov  Hamiltonian  can  be  determined 
by  numerically  minimizing  the  Hamiltonian 
(1-4)  averaged  over  state  (5),  subject  to  the 
normalization  condition  X)i=i,v  I  P=  1- 
In  the  case  7=0,  the  full  quantum  mechanical 
Davydov  model  can  be  solved  exactly  and  it 
has  been  shown  that  the  semiclassical  Davydov 
model  leads  to  the  same  minimum  energy  states 
[7].  For  J  ^0  the  solution  to  the  full  quantum 
model  is  not  known.  In  fig.  1  the  minimum  en¬ 
ergy,  as  determined  numerically  for  the  semi¬ 
classical  Davydov  model  is  compared  with  that 
determined  approximately  by  Venzl  and  Fischer 
[S]  for  the  full  quantum  mechanical  Hamilto¬ 
nian.  Letting ^=;f^K 7,  (with x=X'*'=X  ”)» for¬ 
mula  (39)  of  [5],  p.  6093,  was  used  when  ft  <1 
and  formula  (55)  of  [5],  p.  6094,  was  used 
when  n  >  \.  Fig.  1  shows  that,  also  for  7  ,^0, 
the  semiclassical  Davydov  model  leads  to  ap¬ 
proximately  the  same  states  as  the  full  quantum 
mechanical  model. 


Fig.  1.  Variation  with  x  of  the  energy  of  minimum  en¬ 
ergy  excited  states  determined  with  semiclassical  Davydov 
model  (triangles)  and  with  the  full  quantum  mechanical 
model  (crosses).  N  =  S0,  J  =  1.549x  10“^^  J,  >f=13N/m. 


The  variable  selected  to  compare  the  results 
between  MC  and  LD  simulations  is  the  thermo¬ 
dynamic  average  state,  as  specified  by  the  av¬ 
erage  probability  per  site  n,  ((|  (p„  |^)),  and  the 
corresponding  average  displacement  differences, 
{(u„  -  This  average  state  is  determined 

by  summing  |  and  u„  -  m„_i  over  all  sam¬ 
pled  states,  after  having  located,  for  each  sam¬ 
pled  state,  the  site  for  which  the  probability  of 
excitation  is  maximum  and  translated  this  max¬ 
im  to  the  middle  of  the  chain  and  performing 
tly  the  same  rotation  with  the  displacement 
rerences.  Fig.  2  displays  the  average  state  as 
calculated  from  the  MC  (solid  line)  and  from 
the  LD  (broken  line)  simulations,  respectively, 
for  the  same  values  of  the  parameters  (see  figure 
caption). 

The  MC  simulations  involved  more  than  1 
million  passes  and  the  LD  were  run  for  more 
than  100  nanoseconds.  Fig.  2  shows  that  the  cor¬ 
relation  between  the  site  of  maximum  excitation 
and  the  lattice  deformation  that  was  observed 
in  the  MC  simulations  is  not  destroyed  by  the 
LD.  In  fact,  it  is  present  in  the  LD  with  approxi¬ 
mately  the  same  strength  (i.e.  the  value  of  the  lat- 


3.  Monte  Carlo  simulations  and  Langevin 
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Fig.  2.  Plot  of  the  thermally  averaged  probability  (upper) 
and  displacement  differences  (lower)  determined  from  the 
MC  simulations  (solid  line)  and  the  LD  (broken  line). 
N  =  50.  J  =  1.549  X  10-22  J,  »c=l3N/m,  x  =  62pN. 

tice  deformation  is  the  same)  as  in  the  MC  sim¬ 
ulation.  The  conclusion  is  thus  that  the  LD  does 
produce  an  accurate  representation  of  the  ther¬ 
mal  bath  in  the  semiclassical  Davydov  model. 


as  the  MC  simulations  indicates  that  the  former 
constitutes  indeed  a  faithful  representation  of 
the  thermal  bath.  This  supports  the  conclusion 
that  the  initial  solutions  used  by  other  authors  do 
lead  to  a  soliton  lifetime  of  a  few  picoseconds  at 
310K  [2,3].  These  conclusions  are  valid  within 
the  semiclassical  Davydov  model.  However, 
the  results  in  section  2  indicate  that  they  are 
approximately  valid  for  the  full  quantum  me¬ 
chanical  model.  A  few  points  remain  however, 
to  be  studied.  They  include  the  temperature  de¬ 
pendence  of  the  parameters,  a  decrease  in  7  as 
temperature  increases  will  lead  to  more  stable 
solitons  at  higher  temperatures  and  especially 
the  second  excited  state  has  yet  to  be  explored. 
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4.  Discussion 

The  semiclassical  Davydov  model  can  be 
solved  exactly  and  thus  the  results  presented 
here  do  not  suffer  from  the  ambiguities  that 
plague  the  full  quantum  mechanical  model, 
where  ansatze  for  the  wavefunctions  are  pro¬ 
posed  whose  degree  of  approximation  is  not 
known.  The  MC  simulations  also  provide  an  ex¬ 
act  simulation  of  the  system  at  finite  tempera¬ 
tures.  The  fact  that  LD  leads  to  the  same  results 
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After  a  short  review  of  our  previous  results  on  temperature  effects  on  Davydov  soliton  dynamics,  we  give  some 
comparisons  between  results  obtained  with  Davydov’s  average  Hamiltonian  method  within  the  |D, )  ansatz  state  in  one 
chain  with  quantum  Monte  Carlo  results  found  in  the  literature.  We  find  that  Davydov's  method  leads  to  quantitatively 
incorrect  results,  but  reproduces  the  qualitative  trends  correctly.  Our  results  obtained  with  different  models  point  into  the 
direction  that  Davydov  solitons  should  be  stable,  if  the  value  of  the  spring  constant  for  hydrogen  bonds  is  larger  than 
usually  assumed.  Therefore  we  return  to  Scott’s  suggestion,  that  in  order  to  simulate  the  three  coupled  hydrogen  bonded 
chains  present  in  protein  a-helices  with  one  chain,  the  spring  constant  and  the  mass  should  be  enlarged.  In  case  of  the  |D, ) 
state  we  present  numerical  simulations  for  three  coupled  chains  including  temperature  with  different  initial  states  and  for 
one  chain  using  Scott’s  revised  parameters  at  300  K. 


1.  Introduction 

For  the  mechanism  of  energy  transport 
through  proteins  Davydov  [1,2]  suggested  that 
the  energy  of  =0.4  eV  released  by  hydrolysis  of 
adenosine-triphosphate  (ATP)  could  be  trans¬ 
ported  in  quanta  of  the  amide-I  (mainly  C=0 
stretch)  vibration  (=0.2  eV).  The  CO  groups  par¬ 
ticipate  in  hydrogen  bonds  which  form  chains 
parallel  to  the  axis  of  a-helical  proteins.  Thus 
the  amide-I  vibration  interacts  with  the  acoustic 
phonons  in  these  chains.  The  [1,2]  excitation  of 
an  amide-I  oscillator  causes  a  distortion  in  the 
lattice  which  in  turn  stabilizes  the  amide-I  excita¬ 
tion.  It  was  found  that  for  certain  regions  of  the 
parameter  space  of  the  model  this  effect  can 
prevent  the  excitation  from  dispersion  via  the 
dipole  -  dipole  coupling  between  neighboring 
CO-groups  in  the  lattice.  The  region  in  which  the 
vibrational  energy  is  localized  can  travel  as  a 
soliton  along  the  chain. 


In  his  original  theory  Davydov  [1]  used  an 
ansatz  for  the  wave  function  (iDj))  which  treats 
the  lattice  classically.  At  zero  temperature  is  has 
been  confirmed  that  Davydov  solitons  exist  for 
parameter  values  appropriate  for  proteins  [3]. 
Also  their  stability  against  disorder  along  the 
chain  was  studied  [4].  The  investigation  of  tem¬ 
perature  lead  to  controversial  results.  Halding 
and  Lomdahl  [5]  found  stable  pulses  at  T  = 
310  K  using  classical  molecular  dynamics  for  pep¬ 
tide  units  moving  in  a  Lennard- Jones  potential. 
Lomdahl  and  Kerr  [6a]  and  others  [7]  used  the 
ID2)  ansatz  together  with  a  damping  and  a  noise 
term  to  introduce  temperature  and  found  no 
stable  solitons  at  310  K  at  a  specific  set  of  param¬ 
eters.  Bolterauer  [8]  argued  that  their  classical 
thermalization  scheme  might  lead  to  a  too  large 
transfer  of  energy  into  the  quantum  system  (os¬ 
cillators).  Cottingham  and  Schweitzer  [9]  applied 
perturbation  theory  to  the  Hamiltonian  after 
partial  diagonalization  and  could  show  (again  for 
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one  set  of  parameters)  that  the  soliton  life-time 
at  300  K  is  too  short  for  biological  processes.  In 
our  previous  work  [4, 10, 11a]  we  prepared  the 
lattice  in  a  thermally  excited  state  prior  to  the 
soliton  start.  We  compared  our  results  to  those 
of  [6a]  and  found  agreement  between  the  models 
if  in  the  Langevin  model  [6a]  the  lattice  is  ther¬ 
mally  equilibrated  before  the  soliton  starts.  We 
could  show  that  in  a  window  in  the  parameter 
space  which  might  well  be  realistic  for  proteins 
travelling  solitons  exist  at  300  K. 

Recently  Brown  et  al.  [12]  have  shown  that 
the  ID2)  state  ansatz  does  not  reproduce  the 
dynamics  of  the  exactly  solvable  small  polaron 
limit  (dipole  -  dipole  coupling  neglected). 
Davydov  [2]  introduced  a  more  sophisticated 
ansatz  state  (|D,))  which  allows  for  quantum 
effects  in  the  lattice.  However,  he  used  the 
energy  expectation  value  for  |D,)  as  a  classical 
Hamiltonian  function  to  derive  equations  of  mo¬ 
tion  [2].  It  was  shown  that  with  these  equations 
|D,)  does  not  reproduce  the  small  polaron  limit 
[12]  either.  With  these  equations  of  motion  and  a 
thermally  averaged  Hamiltonian  Davydov  [2] 
could  show  within  the  continuum  limit  that  sol¬ 
itons  exist  at  300  K.  Cruzeiro  et  al.  [13]  reached 
at  the  same  conclusion  numerically  without  mak¬ 
ing  use  of  Davydov’s  approximations,  but  using 
also  the  thermally  averaged  Hamiltonian  as  a 
classical  Hamiltonian  function. 

Most  recently  Mechtly  and  Shaw  [14]  and 
Skrinjar  et  al.  [15]  could  derive  new  equations  of 
motions  for  |D,)  with  help  of  quantum  me¬ 
chanical  methods.  These  equations  of  motion 
reproduce  the  small  polaron  limit.  We  also  used 
the  Langrangian  method  described  in  [15]  to 
obtain  correct  equations  of  motion  for  the  |D,) 
ansatz  state  from  the  thermally  averaged  Hamil¬ 
tonian  derived  in  [2,13].  In  this  investigation 
[16-18],  as  well  as  in  our  previous  studies  within 
the  iDj)  state,  summarized  in  [17]  we  found  that 
Davydov  solitons  should  be  stable  at  300  K  if  the 
spring  constant  of  the  hydrogen  bonds  is  larger 
than  previously  assumed.  These  results  are  short¬ 
ly  reviewed  in  the  first  two  parts  of  section  2  of 


this  work.  There  are  doubts,  if  the  Davydov 
concept  of  using  a  thermally  averaged  Hamilto¬ 
nian  to  derive  equations  of  motion  from  it,  is  in 
agreement  with  statistical  mechanics.  There  is 
the  possibility  that  it  may  lead  to  results  which 
are  even  qualitatively  misleading.  Therefore  we 
present  in  the  third  part  of  section  2  a  com¬ 
parison  of  our  results  obtained  with  the  averaged 
Hamiltonian  method  with  the  exact  quantum 
Monte  Carlo  results  of  Wang  et  al.  [19]. 

Since  all  our  results  point  into  the  direction, 
that  the  hydrogen  bond  spring  constant  should 
be  large  to  allow  soliton  formation  at  300  K,  we 
go  back  to  the  suggestion  of  Scott  [3,20,21]  that 
in  one  chain  simulations  the  spring  constant 
should  be  larger  by  a  factor  of  three,  in  order  to 
simulate  the  three  coupled  chains  present  in  real 
protein  a -helices  within  a  one  chain  model.  For 
an  excellent  review  of  the  state  of  art  of  work  on 
Davydov  solitons  the  reader  should  consult 
Scott’s  recent  paper  [21].  In  the  first  part  of 
section  3  we  present  one  chain  simulations  with 
revised  parameters  according  to  Scott’s  sug¬ 
gestion.  In  the  second  part  of  section  3  we 
present  simulations  on  three  chains  using  differ¬ 
ent  excitations  (symmetric  as  well  as  asymmetric 
ones)  within  the  iDj)  ansatz  state  and  compare 
them  with  one  chain  results,  using  different  pa¬ 
rameter  values.  Finally  in  the  third  part  some 
examples  of  three  chain  simulations  including 
temperature  are  discussed. 


2.  One-chain  simulations  at  finite  temperature 

2.1.  The  1^2)  ansatz  state 

The  Hamiltonian  used  for  this  study  is  in  the 
most  simple  form  for  the  system  investigated  by 
Davydov  [1].  More  sophisticated  forms  of  the 
Hamiltonian  which  incorporate  more  details  of 
the  protein  structure  have  lead  qualitatively  to 
the  same  results  [3]. 
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+  ^Ko„iq„-  q„-i)  . 


(1) 


In  eq.  (1)  d*  (a„)  are  the  usual  boson  creation 
(annihilation)  operators  [3]  for  the  amide-I  oscil¬ 
lators  at  sites  n  (see  fig.  1). 

From  infrared  spectra  the  ground  state  energy 
of  an  isolated  amide-I  oscillator  can  be  deduced 
(£(,  =  0.205  eV).  Usually  for  all  parameters  in 
eq.  (1)  site-independent  mean  values  are  used. 
The  average  value  for  the  dipole  -  dipole  cou¬ 
pling  between  neighboring  amide-I  oscillators  is 
J  =  0.967  meV.  The  average  spring  constant  of 
the  hydrogen  bonds  is  taken  usually  to  be 
IV=13N/m.  p„  is  the  momentum  and  q„  the 
position  operator  of  unit  n.  The  average  mass  M 
is  taken  as  that  of  myosine  (A/  =  114nip;  = 

proton  mass).  The  energy  of  the  CO  stretching 
vibration  in  hydrogen  bonds  is  a  function  of  the 
length  r  of  the  hydrogen  bond  (£=£■„  +  Xr). 
For  X  the  experimental  estimates  are  35  pN  and 
62  pN. 

For  the  solution  of  the  time  dependent  Schro- 
dinger  equation  we  used  the  displaced  oscillator 
state  ansatz  (iDj))  of  Davydov  [1].  Davydov 
[1,2]  formed  the  expectation  value  of  the  Hamil¬ 
tonian  (1)  with  iDj)  and  used  this  expectation 
value  as  classical  Hamiltonian  function.  In  this 
way  he  obtained  the  equations  of  motion.  Ex¬ 
plicit  forms  of  the  equations  of  motion  used  can 
be  found  in  ref.  [11a].  The  iDj)  state  reproduces 
the  lattice  dynamics  for  7  =  0  correctly,  but  leads 
to  an  incorrect  phonon  energy  [12].  From  our 
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Fig.  1 .  Schematic  picture  of  a  hydrogen  bonded  channel  in  a 
protein. 


results  we  deduced  that  within  the  ID2)  model 
Davydov  solitons  should  be  stable  at  300  K  if  the 
spring  constant  W  of  the  hydrogen  bonds  is 
larger  than  roughly  30N/m  (see  [11a, 17]  for 
details). 

A  very  appealing  model  for  treating  tempera¬ 
ture  effects  on  Davydov  solitons  was  recently 
introduced  by  Cottingham  and  Schweitzer  [9]. 
They  [9]  could  diagonalize  the  Hamiltonian  par¬ 
tially.  Then  one  can  define  H  =  Hf^  +  W  where 
Ho  is  the  diagonal  part  of  H  and  W  the  non¬ 
diagonal  one.  Thus  W  can  be  treated  as  a  pertur¬ 
bation.  This  partitioning  is  reasonable,  and  for 
their  purpose  unique,  because  the  Davydov  sol- 
iton  state  (iD,))  is  an  exact  eigenstate  of  and 
thus  first  order  perturbation  theory  using  W  as 
perturbation  allows  the  calculation  of  soliton  life¬ 
times.  They  could  compute  the  transition  prob¬ 
ability  from  an  initial  state  containing  a  Davydov 
soliton  and  a  thermal  distribution  of  phonons  to 
a  final  state  without  the  soliton.  From  this  prob¬ 
ability  they  could  derive  an  explicit  expression 
for  the  life-time  of  a  pinned  soliton. 

To  compare  our  model  with  that  of  Cotting¬ 
ham  and  Schweitzer  [9]  we  took  a  cyclic  chain  as 
in  ref.  [9]  where  only  pinned  solitons  are  found 
numerically  [9].  We  used  W=13N/m.  X  = 
62  pN  and  a„(0)  =  A  sech[(a„  -  100)A'^/Wy]  as 
initial  condition  as  in  [9].  In  addition  we  im¬ 
plemented  the  symmetric  interaction  into  our 
program  for  the  purpose  of  comparison,  since  in 
ref.  [9]  this  model  was  used.  As  shown  in  [13] 
the  two  interaction  models  lead  to  rather  differ¬ 
ent  results.  For  comparison  we  varied  J  and 
estimated  soliton  life-times  from  our  model.  For 
this  purpose  the  initial  excitation  (ID2)  soliton 
solution)  was  centered  around  site  100  and  we 
used  the  soliton  detector  plot  of  Lomdahl  and 
Kerr  [6a].  We  found  [11a, 17]  that  the  two  mod¬ 
els  agree  very  well.  However,  one  should  keep 
two  restrictions  in  mind:  in  case  of  small  J  the 
continuum  limit  used  in  [9]  does  not  apply  and 
the  formula  for  the  life-time  given  in  [9]  still 
contains  approximations  which  were  removed  in 
a  later  paper  [23].  Thus  the  nearly  quantitative 
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agreement  [11a, 17]  between  the  two  models 
should  be  considered  as  accidental.  However,  we 
expect  that  also  with  the  improved  perturbation 
treatment  [23]  qualitative  agreement  should  still 
be  found.  With  the  new  formula  [23]  for  the 
parameters  life-times  between  0.30  ps  and  322  ps 
for  J  between  1  meV  and  0.1  meV  were  found 
[24],  This  still  agrees  qualitatively  with  the  range 
of  0.6-96  ps  from  our  model,  at  least  for  larger  7 
values.  For  the  smaller  ones  the  continuum  ap¬ 
proximation  used  in  [23]  breaks  down. 

However,  studies  on  Davydov  soliton  life¬ 
times  should  not  be  based  on  the  symmetric 
interaction  ansatz,  where  the  length  of  a  hydro¬ 
gen  bond  in  which  the  C=0  group  does  not  take 
part  is  considered  as  to  influence  the  C=0  group 
as  much  as  the  hydrogen  bond  in  which  it  partici¬ 
pates.  This  seems  to  be  rather  unrealistic.  To 
give  some  examples,  we  performed  similar  calcu¬ 
lations  as  reported  above  for  parameter  values 
where  we  found  stable  solitons  at  300  K  with 
asymmetric  interaction  and  open  (instead  of  cy¬ 
clic)  boundaries.  In  case  of  AT  =  1 10  pN  and 
W=  90  N/m  we  find  a  stable  soliton  in  our  simu¬ 
lations,  observed  in  the  computer  experiment  for 
more  than  100  ps.  However,  if  we  perform  the 
calculation  as  described  above  with  symmetric 
interaction  and  cyclic  boundaries  we  obtain  a 
life-time  of  only  3-4  ps.  This  again  is  quali¬ 
tatively  in  the  same  order  of  magnitude  as  per¬ 
turbation  theory  which  gives  0.8  ps  in  this  case 
[24]. 

In  general  it  seems  that  ID,)  dynamics  which 
approximates  the  lattice  classically  leads  to  sol¬ 
itons  which  are  more  stable  against  temperature 
than  in  theories  which  apply  the  exact  Hamilto¬ 
nian,  like  perturbation  theory  [9,23].  Also  Bol- 
terauer  [25]  who  studied  the  uncertainty  of  the 
soliton  energy  in  order  to  obtain  lifetimes  found 
that  both  quantum  and  temperature  fluctuations 
should  destroy  the  soliton  within  less  than  1  ps. 
However,  these  studies  have  in  common,  that 
the  decay  time  of  a  soliton  as  given  by  the  exact 
continuum  limit  solution  for  the  iDj)  ansatz 
state  is  investigated  within  the  exact  Hamilto¬ 


nian.  It  seems  to  be  more  realistic  to  investigate 
the  dynamics  of  an  initial  one-  or  two-site  excita¬ 
tion  than  that  of  a  preformed  soliton  obtained 
from  jDj)  continuum  theory.  As  we  found  in 
our  studies,  the  solitons  formed  from  such  an 
excitation  have  properties  which  can  differ  con¬ 
siderably  from  the  continuum  solutions.  To  study 
quantum  effects  on  solitons  one  has  to  compute 
the  dynamics  of  such  initial  excitations  with  bet¬ 
ter  ansatz  states,  instead  of  calculating  the  life¬ 
time  of  Id,)  solitons  within  the  exact  Hamilto¬ 
nian,  since  these  ID2)  solitons  are  only  solutions 
of  and  not  of  H.  The  full  Hamiltonian  H  can 
have  soliton  solutions  which  might  differ  drasti¬ 
cally  from  these  approximative  |D,)  solitons. 
Since  the  Schrodinger  equation  for  H  cannot  be 
solved  exactly,  besides  in  special  cases,  we  ap¬ 
plied  as  a  first  step  the  improved  |D,)  ansatz 
state  of  Davydov  [2]  which  allows  quantum  ef¬ 
fects  in  the  lattice  and  also  reproduces  the  exact 
solutions  in  the  small  polaron  limit  (7  =  0)  if  the 
equations  of  motion  are  correctly  derived  [14,15] 
but  is  still  an  approximation  for  7  #  0.  The  re¬ 
sults  of  these  investigations  are  reported  in  the 
next  section. 

2.2.  The  |D,)  ansatz  state 

In  this  chapter  we  want  to  report  on  our 
results  at  T  =  300K  using  the  |D,)  state  ansatz 
[17,18].  The  Hamiltonian  [1,2]  in  second  quan¬ 
tized  form  including  disorder  is  given  e.g.  in 
[17,18].  The  |D,)  ansatz  for  an  inclusion  of 
temperature  in  Davydov’s  approximation  for  so¬ 
lution  of  the  time  dependent  Schrodinger  equa¬ 
tion  is  [22] 

|D„^)  =  Sa:(0«:|0),|)8„,^').  (2) 

n 

Here  |0),,  is  the  exciton  vacuum,  and  a 
coherent  phonon  state.  For  the  one-quantum 
oscillator  states  used  here  S|a„|’  =  I  holds.  To 
include  temperature  approximately  we  assume, 
as  in  [13]  that  a  phonon  distribution  is  present  in 
the  lattice  where  each  normal  mode  is  occupied 
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by  quanta.  We  do  not  consider  a  thermal 
distribution  of  amide-I  quanta  since  at  300  K  the 
Boltzmann  factor  implies  that  only  3  of  10000 
amide-I  oscillators  would  thermally  excited. 
Thus  one  can  neglect  a  possible  thermalized 
soliton  distribution  in  the  system  too,  since  pres¬ 
ence  of  solitons  requires  first  of  all  amide-I  exci¬ 
tation.  Then  the  expectation  value  of  the  Hamil¬ 
tonian  in  this  state  is  formed  and  thermally 
averaged  over  all  phonon  distributions.  From 
this  equations  of  motion  are  derived  using  the 
Euler- Lagrange  formalism  [17,18]. 

This  method  introduced  by  Davydov  was  criti¬ 
cized  by  several  authors  as  being  inconsistent 
with  statistical  mechanics,  since  equations  of  mo¬ 
tion  are  obtained  from  a  thermally  averaged 
Hamiltonian.  However,  we  feel  that  it  might  still 
be  a  reliable  approximation  to  the  real  dynamics 
under  physiological  temperature.  To  investigate 
this  we  also  give  a  comparison  with  quantum 
Monte  Carlo  simulations  in  the  next  section. 
However,  since  in  previous  work  on  Davydov’s 
method  [2,13]  his  incorrect  derivation  of  equa¬ 
tions  of  motion  was  used,  it  seems  to  be  interest¬ 
ing  in  itself,  to  study  the  dynamics  of  the  system 
using  correct  equations  derived  from  the  aver¬ 
aged  Hamiltonian. 

The  equations  of  motion  have  been  solved 
using  a  fourth  order  Runge-Kutta  method  [19] 
and  the  dynamics  were  calculated  through 
=26  ps.  We  performed  a  survey  of  the  (A",  W) 
parameter  space.  At  7'  =  OK  solitons  occur  at 
much  larger  values  of  X  (>150  pN  [16])  and 
smaller  values  of  W  than  at  300  K  [17,18].  The 
reason  for  that  is,  that  effectively  the  dipole  - 
dipole  coupling  J  which  is  responsible  for  disper¬ 
sion  decreases  with  increasing  temperature,  in 
this  way  a  larger  temperature  can  stabilize  sol¬ 
itons  at  smaller  values  of  X.  This  is  consistent 
with  results  reported  by  Cruzeiro  et  al.  [13], 
although  the  effect  is  far  more  pronounced  in 
our  case  using  the  quantum  mechanically  correct 
equations  of  motion.  Thus  we  arrive  with  |D,) 
dynamics  at  the  same  conclusion  as  with  |D,) 
dynamics:  Davydov  solitons  should  be  stable  at 


3(X)  K  if  the  spring  constant  of  the  lattice  is  much 
larger  than  the  usually  applied  13N/m.  This 
leads  us  first  of  all  to  the  suggestion  of  Scott 
[3,21]  that  one  has  to  take  into  account  that  in  an 
a-helix  three  coupled  chains  are  present.  Scott 
argued  that  in  one  chain  dynamics  one  has  to  use 
a  larger  effective  value  for  W  and  M  in  order  to 
simulate  the  three  chain  case  with  a  one  chain 
model.  We  will  investigate  this  in  section  3  in 
more  detail.  But  before  turning  to  this  point,  we 
want  to  discuss  some  basic  problems  of  the  mod¬ 
els  applied.  One  may  ask  if  it  is  correct  to 
introduce  temperature  in  an  averaged  way  as 
done  here  and  if  not  fluctuations  taken  explicite- 
ly  into  account  could  destroy  the  solitons.  How¬ 
ever,  we  have  shown  in  our  previous  work 
[11,17]  that  in  case  of  the  |D,)  ansatz  models 
with  explicit  random  fluctuations  and  our  model 
using  a  lattice  prepared  to  7  =  300  K  in  a  de¬ 
terministic  way  lead  to  comparable  results.  Thus 
we  are  confident  that  in  the  |D, )  case  the  same 
conclusion  holds.  The  other  problem  is  the  ap¬ 
proximative  nature  of  the  |D,)  state.  To  im¬ 
prove  the  ansatz  in  this  respect  we  plan  to  take 
two-phonon  terms  into  account  in  the  generator 
of  the  unitary  transformation  ]14].  A  more  se¬ 
vere  problem  is  the  inconsistency  of  Davydov’s 
method  for  incorporation  of  temperature  with 
statistical  mechanics.  Thus  we  want  to  compare 
Id,)  results  with  exact  quantum  Monte  Carlo 
simulations  ii.  the  next  section,  to  be  able  to 
decide  if  Davydov’s  method  can  still  be  consid¬ 
ered  as  an  at  least  qualitatively  correct  approxi¬ 
mation. 

^  Comparisons  with  exact  quantum  Monte 
Carlo  results 

Wang  et  al.  [19]  reported  the  results  of  quan¬ 
tum  Monte  Carlo  simulations  on  the  Davydov 
Hamiltonian.  These  results  should  describe  the 
equilibrium  state  in  principle  exactly,  restricted 
only  by  numerical  inaccuracies  which  can  be 
controlled.  They  applied  the  parameters  usually 
used  in  the  literature  (7  =  0.967  meV. 
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W-  =13N/m,  A'  =  62pN,  Af=114mp),  cyclic 
boundaries,  the  symmetric  interaction,  and  rings 
of  24  sites.  They  determined  for  each  of  their 
configurations  the  excitation  site  n^,  and  rotated 
all  coordinates  such  that  this  site  is  in  the  middle 
of  their  lattice.  Then  they  computed  the  average 
{A„)  of  the  lattice  displacements  -  q„^^  — 
ti„-i-  Here  n  refers  to  the  rotated  coordinate 
system,  where  the  excitation  site  n,,  is  always  in 
the  center.  We  implemented  cyclic  boundaries 
and  symmetric  interaction  into  our  time  simula¬ 


tion  programs,  started  from  a  random  distribu¬ 
tion  of  one  amide-I  vibrational  quantum  and 
performed  a  time  average  of  the  same  quantity 
with  which  Wang  et  al.  performed  their  ensem- 
l  ;  average.  We  determined  n„  in  each  time  step 
as  the  site  where  the  excitation  probability 
kn(^)l*  >5  largest.  In  this  way  after  a  sufficient 
number  of  time  steps  (convergence  of  (A„))  we 
should  obtain  results  comparable  with  those 
from  Wang  et  al.  [19]. 

In  [19]  it  is  reported,  that  at  2.8  K  coherent 


Fig.  2.  Evolution  of  the  excitation  probability  |a„fr)l’  from  an  initially  randomly  distributed  amide-1  quantum  as  function  of  site  n 
and  time  t  in  cyclic  chains  of  25  units  and  with  symmetric  interaction,  using  the  ID,)  ansatz  state  and  Davydov's  model  for 
temperature  effects.  Each  subfigure  ( 1-6)  shows  every  5  000th  of  500000  time  steps  each  (t  =  2  fs).  (a)  T  =  2.8  K;  (b)  T  =  7.0  K- 
(c)  T=  11.2  K. 
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Fig.  2 

structures  are  obtained.  In  this  case  from  their 
fig.  1  we  estimate  a  maximum  of  at  «  =  «„ 
of  =0.07  A.  For  7  =  7.0  K  they  observe  that  this 
coherent  structure  starts  to  break  down,  leading 
at  1 1 .2  K  to  localized  structures  comparable  to 
small  polarons  (this  might  also  be  interpreted  as 
an  Anderson-like  localization  originating  from 
increased  disorder  due  to  thermal  fluctuations). 
For  7.0  K  the  maximum  of  (i4„)  is  found  to  be 
=0.08  A  and  =0.09  A  for  11.2  K.  The  latter 
value  is  already  close  to  the  infinite  temperature 
limit  of  =0.095  A  (19). 

In  fig.  2  we  show  our  result  obtained  with  the 


nt.). 

j£),)  state  and  Davydov’s  model  for  temperature 
effects,  however,  using  correct  equations  of  mo¬ 
tion  as  given  in  the  previous  section.  We  fol¬ 
lowed  the  dynamics  over  a  period  of  6  ns,  corre¬ 
sponding  to  3  000  000  time  steps.  The  calculation 
for  one  temperature  had  to  be  done  in  6  runs, 
where  one  requires  7.7  CPU  hours  computation 
time  on  a  Cyber  995E  computer  from  Control 
Data  Corporation  (500000  time  steps  each  run). 
Typically  (7=  11.2  K)  the  error  in  total  energy 
in  such  runs  is  =3  neV  and  the  norm  error  is  less 
than  0.05  ppm  (parts  per  million).  We  see  from 
fig.  2a  that  at  7=2.8K  after  roughly  Ins  a 
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Fig-  2  (com  ). 


soliton  nucleates  from  the  random  initial  condi¬ 
tions  and  performs  a  random  walk  in  the  system. 
This  corresponds  qualitatively  to  the  coherent 
structure  found  by  Wang  et  al.  [19]  at  this  tem¬ 
perature.  At  T^T.OK  (fig.  2b)  also  such  a 
localized  packet  forms,  but  it  is  first  of  all  smaller 
and  after  2  ns  it  remains  confined  within  a  few 
lattice  sites,  again  corresponding  to  the  destruc¬ 
tion  of  the  coherent  structure  towards  a  localized 
state  reported  in  [19]  for  this  temperature.  Final¬ 
ly  at  1 1.2  K  the  packet  remains  in  the  middle  of 
the  chain  and  becomes  smaller,  again  in  quali¬ 
tative  agreement  with  [19],  although  it  seems 
that  in  |D, )  theory  the  building  of  small  polaron- 


like  structures  is  shifted  to  higher  temperatures. 
Thus  Davydov’s  model  appears  to  be  in  quali¬ 
tative  agreement  with  quantum  Monte  Carlo  re¬ 
sults.  However,  it  does  not  agree  quantitatively 
as  fig.  3  shows  where  we  present  the  time  aver¬ 
age  {A^)  of  the  lattice  displacements  through 
3  000000  time  steps.  Obviously  the  structure 
found  has  a  decreasing  width  with  increasing 
temperature,  but  the  peak  values  are  =-0.026  A 
(2.8 K),  =-0.028 A  (7.0 K)  and  =-0.029 A 
(11.2  K)  in  contrast  to  the  much  larger  values 
found  in  [19]  (-0.07  A,  -0.08  \,  -0.09  A,  re¬ 
spectively).  Also  the  increase  of  this  peak  value 
with  temperature  is  much  less  pronounced  than 
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<*n>  (**)  <An>(8)  <A^>  (8) 


Fig.  3.  Time  average  {A„)  over  3000  000  time  steps  (6ns)  of  the  lattice  displacements  in  the  rotated  coordinate  system  for 
the  system  of  fig.  2  at  the  same  temperatures. 

in  the  quantum  Monte  Carlo  case.  Therefore  we  sing  ansatz  shows  a  narrow  and  too  high  peak  for 

conclude  that  Davydov’s  |D,)  model  gives  a  all  three  temperatures.  The  peak  does  not 

qualitatively  correct  picture,  but  fails  to  repro-  change  for  increasing  temperature  and  thus  for 

duce  exact  results  quantitatively.  this  ansatz  we  even  do  not  find  qualitative  agree- 

Let  us  turn  now  to  the  partial  dressing  state  of  ment  with  quantum  Monte  Carlo  results. 

Brown  and  Ivic  [26]  which  is  a  special  case  of  the  Finally  we  computed  again  the  average  (/4„) 

|D,)  ansatz  with  a  fixed  dependence  of  the  for  the  ID2)  ansatz  with  our  temperature  model, 
coherent  state  amplitudes  on  the  site  n.  Our  In  the  ID2)  case  the  peak  values  of  {A„)  are 
simulations  at  0  K  for  this  state  have  shown  that  again  smaller  than  the  corresponding  quantum 

the  results  are  rather  similar  to  |D,)  dynamics,  Monte  Carlo  values,  however,  with  the  wrong 

however,  with  solitons  appearing  at  even  higher  tendency:  they  decrease  somewhat  with  increas- 

values  of  X  than  in  zero-temperature  |Z?,)  ing  temperature  (2.8  K:  -0.0107  A;  7.0  K: 

theory  [16].  Our  implementation  of  the  partial  -0.0106  A;  11.2  K:  -0.0098  A).  The  decrease  is 

dressing  theory  including  temperature  is  de-  not  very  pronounced  and  it  seems  that  all  three 
scribed  in  [16]  and  we  do  not  repeat  it  here.  In  values  are  far  off  the  infinite  temperature  value 

case  of  this  state  already  after  900000  time  steps  given  in  [19].  Since  there  are  no  solitary  or  small 

of  2  fs  convergence  of  the  average  {A„  )  is  ob-  polaron  like  structures  visible  in  the  \a„\^  plots 

tained.  In  this  case  no  solitons  are  nucleated.  We  and  the  infinite  temperature  behavior  shows  al- 

found  that  here  the  infinite  temperature  value  ready  up  at  T  =  2.8  K  it  seems  that  here  the 

from  [19]  for  the  peak  is  underestimated  by  all  temperature  effects  are  overestimated  in  agree- 

three  curves.  Thus  even  at  2.8  K  where  in  [19]  ment  with  Bolterauer’s  [8]  argument.  In  conclu- 

coherent  structures  were  found,  the  partial  dres-  sion  it  seems  that  among  the  models  studied,  the 
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|D,)  ansatz  together  with  Davydov’s  treatment 
of  temperature  is  the  only  one  which  gives  an  at 
least  qualitatively  correct  picture,  although  it  is 
quantitatively  incorrect  and  furthermore  there 
are  doubts  on  the  validity  of  Davydov’s  ansatz 
for  the  description  of  temperature.  However,  it 
seems  that  Davydov’s  ansatz  can  be  viewed  as  a 
qualitatively  valid  approximation  (see  [27]  for 
details). 

3.  Three  chain  dynamics 

3. 1 .  One  chain  dynamics  at  T  =  300  K  with 
enlarged  mass 

In  this  paragraph  we  want  to  follow  Scott’s 
suggestion  to  use  revised  parameters  in  one 
chain  dynamics  in  order  to  simulate  three  chain 
dynamics  with  them.  Since  we  want  to  survey  the 
{X,  W)  parameter  space,  we  only  need  to  change 
the  site  mass  from  114mp  to  342mp.  However, 
we  have  to  keep  in  mind,  that  then  in  these 
simulations  W  is  an  effective  spring  constant  and 
no  longer  the  spring  constant  of  an  individual 
hydrogen  bond.  We  used  chains  of  50  units  and 
an  initial  excitation  at  one  site  (49)  as  in  the 
calculations  reported  before.  The  temperature 
was  300  K  using  Davydov’s  model  for  tempera¬ 
ture  effects  and  the  |D,)  ansatz  state.  The  time 
step  was  0.15  fs  and  we  followed  the  dynamics 
over  roughly  26  ps.  In  case  of  the  |D,)  state 
Scott’s  analytical  considerations  for  an  /I-mode 
excitation  within  the  jD,)  state  which  are  re¬ 
peated  shortly  in  the  next  section  do  not  hold. 
Thus  we  cannot  expect  that  also  for  the  |D,) 
ansatz  the  three  chain  case  is  reproduced  by  one 
chain  dynamics  if  we  use  3W  and  3M  instead  of 
W  and  M. 

The  results  of  our  calculations  are  displayed  in 
fig.  4  Since  the  enlarged  mass  influences  consid¬ 
erably  the  phonon  frequencies  we  have  to  expect 
that  interactions  between  the  soliton  and  sound 
waves  in  the  lattice  might  be  different  from  the 
one  chain  case  with  smaller  site  mass.  Indeed 


0  20  40  60  60  100 

X(pN) 

Fig.  4.  Survey  of  the  (A'.  W)  parameter  space  at  T  =  300  K 
using  Davydov's  temperature  model  and  the  |D,)  ansatz 
state  together  with  a  mass  of  3Af  =  342mp  (initial  excitation 
at  site  49,  time  step  O.lSfs,  chains  of  50  units,  crossed  open 
circles:  pinned  soliton,  horizontally  dashed  black  circles: 
pinning  or  reflection  after  roughly  40  sites,  vertically  dashed 
black  circle:  slowly  dispersive  solitary  wave,  horizontally 
crossed  black  circle:  soliton  which  becomes  pinned  after  a 
few  sites,  diagonally  crossed  black  circle:  moving  soliton 
which  becomes  pinned  after  interaction  with  the  shock  wave 
and  then  slowly  disperses,  black  circle:  travelling  soliton). 

this  is  the  case  as  the  figure  shows.  We  find  many 
parameter  values  where  the  solitons  become 
pinned  after  interaction  with  the  sound  wave 
reflected  from  the  chain  end.  After  that  inter¬ 
action  in  most  cases  the  solitons  disperse  slowly 
and  perform  a  random  walk  around  the  pinning 
site.  For  W=80N/m  and  A’  =  60pN  the  soliton 
is  even  reflected  from  the  shock  wave,  but  not 
destroyed.  Therefore  we  marked  this  point  in  the 
parameter  space  as  a  point  where  travelling  sol¬ 
itons  exist.  In  general  we  see  that  the  picture  is 
very  similar  to  that  for  the  reduced  mass  dis¬ 
cussed  in  section  2.2.  The  region  of  stable  sol¬ 
itons  is  again  shifted  to  small  X-  and  rather  large 
W-values,  and  the  boundary  of  this  region  ap¬ 
pears  to  be  roughly  linear.  However,  here  W  has 
another  meaning  than  in  section  2.2.  Thus  the 
threshold  for  soliton  formation  of  W«40N/m 
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should  correspond  to  a  spring  constant  for  single 
hydrogen  bonds  of  roughly  13N/m  which  is  the 
usually  cited  value,  while  the  also  used  value  of 
l9N/m  corresponds  here  to  W  =  60N/m.  How¬ 
ever,  the  factor  of  3  holds  only  for  |D,)  theory, 
while  for  |D,)  this  is  not  the  case.  Further  the 
figure  shows  that  for  a  value  A”  =  60  pN  W  has  to 
be  larger  than  70N/m  to  allow  travelling  sol¬ 
itons,  which  corresponds  to  a  hydrogen  bond 
spring  constant  of  =25  N/m. 

3.2.  Three  chain  simulations  at  T  =  0  K 

As  Scott  pointed  out  already  (see  [21  j  for  a 
recent  review)  a  long  time  ago  [31,  in  an  a-helix 
three  parallel  chains  of  hydrogen  bonds  exist, 
which  are  coupled  by  dipole-dipole  interactions. 
If  a  is  an  index  which  specifies  the  chains  (a  =  1, 
2,  or  3),  and  if  L  is  the  coupling  parameter 
between  two  neighboring  C=0  oscillators  on  dif¬ 
ferent  chains,  the  equations  of  motion  for  the 
simple  Id,)  ansatz  state  are  [3,21]: 

I.J 

+  -  (3) 

=  VV(  „  -  2q„„  +  ) 

(4) 

Scott’s  argument  is  now,  that  the  usually  used 
values  for  the  parameters  W  and  M  apply  only 
for  the  case  of  three  coupled  chains.  He  found 
[3]  for  the  interchain  coupling  parameter  a  value 
of  L  =  1.54  meV.  For  this  value  of  L  and  stan- 
dar.'  values  for  the  other  parameters  (W  =  13  N/ 
m,  Af  =  1 14mp,  A'  =  62  pN,  and  J  =  0.967  meV) 
we  performed  simulations  for  a  three  chain  sys¬ 
tem,  as  Scott  did  for  one  special  case  of  excita¬ 
tion  [3].  We  used  the  symmetric  A-mode  (a„,  = 
=  «„,),  the  linear  combination  of  the  two 
degenerate  £-modes  («„,=(),  a„2  =  l/V2, 

a„y  =  - 1  l\fl)  and  an  asymmetric  local  excitation 


(L)  of  one  unit  on  a  single  chain.  In  the  case  of 
the  A -mode  the  soliton  consists  of  three  identi¬ 
cal,  parallel  moving  localized  excitations  on  all 
three  chains,  while  in  the  £-mode  the  soliton 
moves  only  on  two  chains.  In  the  case  of  the 
local  excitation  the  soliton  is  found  mainly  on 
one  chain,  with  a  small  fraction  of  the  excitation 
transferred  to  the  others.  Scott  [3,21]  found  for 
the  A-mode  that  the  equations  of  motion  for 
three  chains  are  identical  to  the  equations  for 
one  chain  but  with  M'  =  3Af  and  W  -  3W.  The 
important  conclusion  is  that  numerically  this 
holds  not  only  for  the  symmetric  A-modes,  for 
which  Scott's  considerations  outlined  above 
hold,  but  also  for  the  £-mode  and  even  for  a 
single  chain  excitation  [28]. 

In  the  more  complicated  |D|)  case  the  equa¬ 
tions  of  motion  for  the  symmetric  A-mode  for 
three  chains  and  for  one  chain  are  identical  as 
shown  in  [28].  We  performed  three  chain  simula¬ 
tions  with  an  A-mode  excitation  and  again  the 
two  single  chain  simulations  (using  W,  M  and 
3Af,  3VF,  respectively),  but  now  computed  in  the 
|D, )  model  (A"  =  62  pN).  We  found  that  the  one 
chain  results  for  parameters  M  and  W  are  the 
same  as  those  from  the  three  chain  calculations 
as  expected  from  the  considerations  outlined  in 
[28].  However,  an  A-mode  excitation  does  not 
necessarily  occur  in  reality  and  thus  we  also  have 
to  study  initial  excitations  in  the  £-mode  and 
ones  which  are  localized  on  one  chain  where  the 
equations  for  one  chain  are  not  the  same  as 
those  for  three  interacting  chains.  This  is  dis¬ 
cussed  in  the  next  section. 

3.3.  Three  chain  dynamics  at  T  =  300  K 

Here  we  want  to  show  results  obtained  with 
the  equations  for  three  chains  with  general  initial 
excitations  in  the  |D,)  case  using  Davydov’s 
model  for  temperature  effects.  The  equations  are 
given  in  [28].  Let  us  first  discuss  a  localized 
excitation  at  one  of  the  terminal  sites  of  just  one 
spine.  We  have  chosen  a  chain  length  <'f  20  units 
for  each  chain  (excitation  at  site  19  of  chain  1) 
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and  the  usual  value  of  1.5373  meV  for  the  inter¬ 
chain  coupling  L  [21].  The  time  step  was  chosen 
as  0.25  fs.  In  this  case  in  a  typical  calculation  on 
localized  initial  excitations  the  total  energy  was 
conserved  within  roughly  2  p-eV  and  the  norm  to 
better  than  5  ppb  (parts  per  billion).  Repetition 
of  one  of  the  calculations  with  a  time  step  of 
0. 1  fs  lead  to  no  changes  in  the  results.  In  fig.  5 
we  show  the  results  of  these  simulations.  We  use 
for  W  the  values  13N/m  from  measurements  on 
formamide  crystals  and  19N/m  from  theoretical 
calculations.  For  X  we  use  35N/m  which  was 
found  by  Scott  and  also  the  usually  applied  value 


of  62  pN  (see  [21]  for  references).  Figures  5a  and 
b  show  that  for  W=  13  N/m  in  case  of  the  lower 
X  value  a  solitary  wave  is  formed  first  which 
becomes  trapped  roughly  in  the  middle  of  the 
chain,  while  for  the  larger  X  value  the  excitation 
is  trapped  close  to  the  chain  end.  In  both  cases  a 
considerable  part  of  the  excitation  goes  over  to 
the  initially  unexcited  chains.  In  case  of 
W=  19  N/m  (5c,  d)  at  the  lower  X  value  a  clear 
solitary  wave  is  formed  and  travels  through  the 
chain.  If  one  looks  closer  at  the  pulse  like  struc¬ 
ture,  one  sees  that  the  excitation  is  oscillating 
between  the  initially  excited  chain  and  the  other 


Fig.  5.  Time  evolution  of  |a„(<)|’  in  the  |D,)  ansatz  state  at  300  K  using  Davydov’s  model  for  temperature  effects  and  a  localized 
initial  excitation  (a,,  ,(0)  =  1,  all  other  a’s  equal  0.005,  then  normalized  to  1,  all  three  chains  are  shown:  chain  1:  n  =  1-20,  chain 
2:  n  =  21-40,  chain  3:  n  =41-60,  time  step:  0.25  fs)  for  different  values  of  W  and  X: 

(a)W=13N/m,  3r  =  35pN;  (b)W=13N/m,  ,V  =  62pN;  (c)W'=19N/m,  A'  =  35pN;  (d)W=19N/m,  A'  =  62pN; 
(e)  1^=40 N/m,  =  62 pN;  (f)  W=  60 N/m,  A'  =  62pN. 
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two.  When  the  excitation  probability  is  small  on 
chain  1,  it  is  large  on  chains  2  and  3  and  vice 
versa.  At  the  larger  X  value  the  excitation  is 
again  trapped  close  to  the  initial  excitation  site. 
For  A'  =  62pN  we  have  increased  W  to  30N/m, 
40N/m,  50N/m  and  60N/m.  Two  cases  are 
shown  in  fig.  5e  (40N/m)  and  f  (60N/m).  As  in 
case  of  one  chain  simulations  an  increase  in  W 
favors  soliton  formation  and  we  found  soliton 
formation  between  W=A0  and  60N/m,  even 
quantitatively  in  fair  agreement  with  one-chain 
and  even  with  |D,)  results.  The  results  for  the 
ii-mode  are  more  or  less  similar  [28].  Again 
from  40  N/m  solitary  wave  formation  can  be 
expected.  In  case  of  A-mode  excitations  a  strong 
tendency  to  trapping  after  a  few  sites  shows  up. 
However,  for  W=  19N/m  and  A'  =  35pN  again 
a  solitary  wave  shows  up.  In  the  cases  of  larger 
W  values  we  observe  again  solitary  wave  forma¬ 
tion,  but  at  40N/m  after  the  reflection  the  sol¬ 
iton  oscillates  around  the  chain  end.  Increasing 
W  to  60N/m  leads  again  to  a  solitary  wave,  but 
it  moves  erraticly  after  reflection  [28].  Thus  we 
reach  at  the  same  basic  conclusion  as  in  case  of 
most  other  models  (with  the  exception  of  the 
partial  dressing  state)  we  studied:  If  W  is  larger 
then  30-40  N/m  solitary  wave  formation  should 
be  possible  in  proteins  also  at  physiological  tem¬ 
perature. 

4.  Conclusion 

With  the  classical  ansatz  state  we  per¬ 
formed  dynamic  simulations  on  one  chain  using 
different  models  for  the  incorporation  of  a  finite 
temperature  into  the  Davydov  soliton  theory. 
We  varied  the  parameters  W  (hydrogen  bond 
spring  constant)  and  X  (oscillator-lattice  cou¬ 
pling  constant).  We  found  that  at  300  K  the 
spring  constant  of  the  hydrogen  bonds  should  be 
larger  than  roughly  30-40 N/m  to  allow  forma¬ 
tion  and  propagation  of  Davydov  solitons  in  the 
system.  From  calculations  with  the  |D,)  ansatz 
state  which  allows  for  quantum  effects  in  the 


lattice  and  Davydov’s  method  for  incorporation 
of  temperature,  we  arrived  at  roughly  the  same 
conclusion.  Since  Davydov’s  method  to  account 
for  temperature  effects  is  believed  by  several 
workers  to  be  inconsistent  with  statistical  mech¬ 
anics,  we  performed  comparisons  with  exact 
quantum  Monte  Carlo  results  (QMC)  [19].  Since 
these  results  are  available  only  ii)r  one  set  of 
parameters  and  with  the  rather  unrealistic  sym¬ 
metric  interaction  ansatz,  we  incor|X)rated  these 
features  into  our  program  to  be  able  to  perform 
comparable  calculations.  Instead  of  the  ensemble 
average  in  QMC  we  performed  a  time  average 
over  the  lattice  displacements  in  a  rotated  coor¬ 
dinate  system.  We  found  that  the  transition  from 
coherent  to  localized  structures  between  2.8  K 
and  11.2  K  reported  in  [19]  is  reproduced  by 
Davydov’s  method.  However,  quantitatively  the 
averaged  lattice  displacements  and  their  vari¬ 
ation  with  temperature  are  underestimated.  Thus 
the  method  serves  as  a  qualitatively  correct  ap¬ 
proximation  to  the  dynamics  at  physiological 
temperature.  The  classical  iDj)  ansatz  seems  to 
overestimate  temperature  effects,  while  the  par¬ 
tial  dressing  ansatz  [26]  leads  neither  to  coherent 
structures  below  7.0  K  nor  to  localized  ones 
above  7.0  K  and  also  underestimates  the  aver¬ 
aged  lattice  displacements  and  their  temperature 
variation. 

For  the  |  D2 )  ansatz  state  we  performed  soliton 
dynamics  in  a  system  of  three  coupled  chains  as 
suggested  and  done  earlier  by  Scott  et  al.  [3].  We 
could  verify  Scott’s  proposal  that  three  chain 
dynamics  can  be  simulated  with  one  chain  calcu¬ 
lations  if  one  takes  as  site  mass  three  times  the 
value  applied  in  tne  three  chain  dynamics  and  for 
the  spring  constant  of  the  hydrogen  bonds  also 
three  times  the  value  for  one  bond.  We  obtained 
this  result  independent  of  the  initial  excitation, 
which  can  correspond  to  the  symmetric  A  mode 
or  to  the  degenerate  E  mode  of  the  three  chain 
system.  Even  for  a  localized  excitation  on  one 
chain  only  we  obtained  agreement  between  three 
chain  dynamics  and  those  using  one  chain  but 
changed  parameters. 
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For  the  |D,)  state  we  surveyed  again  the 
(A’,  y)  parameter  space  at  300  K  using  one  chain 
and  3M  as  site  mass  for  stable  solitons  within 
Davydov’s  method.  We  found  that  W  has  to  be 
larger  than  roughly  70N/m  to  allow  soiiton  for¬ 
mation  around  A  =  60  pN.  Since  this  value  would 
correspond  to  a  spring  constant  of  about  25  N  / 
m,  we  arrive  more  or  less  at  the  same  conclusion 
as  discussed  above  for  one  chain,  if  the  sug¬ 
gestion,  that  three  chain  dynamics  could  be  re¬ 
produced  by  one  chain  simulations  if  a  site  mass 
of  3 A/  is  used,  is  correct  also  in  the  |D,)  case. 
However,  at  OK  we  found  that  in  the  |D,)  case 
three  chain  dynamics  with  parameters  M,  W  and 
A  are  identical  to  one  chain  simulations  with 
the  same  parameters  in  contrast  to  jDj)  dy¬ 
namics. 

With  explicit  three  chain  simulations  at  7  = 
300  K  using  Davydov’s  temperature  model  we 
found  again  that  solitary  waves  are  formed  at  W 
values  larger  than  30-40  N/m.  However,  if  A  is 
around  35  pN  instead  of  62  pN  already  from  W 
around  19  N/m  solitary  waves  were  observed. 
Thus  the  conclusion  drawn  earlier  by  us  remains 
unchanged:  If  the  spring  constant  of  the  hydro¬ 
gen  bonds  in  protein  a -helices  is  larger  than 
30-40  N/m  the  Davydov  soiiton  should  be  able 
to  function  at  300  K.  Interestingly  this  conclusion 
is  reached  with  both  of  Davydov’s  ansatz  states 
and  with  different  models  for  temperature  ef¬ 
fects.  Since  the  usually  quoted  value  of  13  N/m 
derives  from  formamide  crystals  where  the  hy¬ 
drogen  bonded  molecules  vibrate  freely  it  should 
be  too  small  for  proteins.  In  proteins  the  hydro¬ 
gen  bonded  sites  are  embedded  in  the  covalent 
backbone  of  the  helix  which  becomes  distorted 
due  to  the  vibration.  Thus  we  expect  the  spring 
constant  of  a  protein  normal  mode  correspond¬ 
ing  to  hydrogen  bond  stretch  to  be  much  larger 
than  that  of  crystalline  formamide,  and  thus 
probably  allowing  for  Davydov  solitons  to  be 
formed  in  proteins.  However,  detailed  calcula¬ 
tions  or  measurements  on  the  spring  constant  in 
proteins  are  necessary  to  decide  finally  on  the 
question  of  existence  of  Davydov  solitons. 
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Our  generalised  Bethe  ansatz  method  is  used  to  formulate  the  statistical  mechanics  of  the  classical  Toda  lattice  in  terms 
of  a  set  of  coupled  integral  equations  expressed  in  terms  of  appropriate  action-angle  variables.  The  phase  space  as 
coordinatised  by  these  action-angle  variables  is  constrained;  and  both  the  soliton  number  density  and  the  soliton 
contribution  to  the  free  energy  density  can  be  shown  to  decouple  from  the  phonon  degrees  of  freedom  and  to  depend  only 
on  soliton-soliton  interactions.  This  makes  it  possible  to  evaluate  the  temp)erature  dependence  of  the  soliton  number 
density  which,  to  leading  order,  is  found  to  be  proportional  to  T'’'. 


1.  Introduction 

This  paper  is  concerned  with  Alwyn  Scott’s  contributions  to  molecular  biology  -  particularly  with  his 
recent  work  [1,2]  on  the  thermal  excitations  of  the  Toda  lattice  [3,4]  viewed  as  a  model  of  the  DNA 
molecule  at  biological  (i.e.  body)  temperatures  =»310K.  It  connects  in  a  natural  way  with  his  paper  to 
this  meeting  [5]  ‘There’s  more  than  one  way  to  skin  Schrodinger’s  cat’  because  even  classical  statistical 
mechanics  is  intimately  related  to  quantum  mechanics  (recall  [6]  the  Gibbs’  paradox  e.g.).  Thus  just  as 
one  can  rewrite  the  title  of  Alwyn  Scott’s  (ACS’s)  paper  [5]  in  another  language,  Danish,  as  ‘Man  kan 
fla  Schrodinger’s  kat  pa  mere  end  en  facon’  (the  title  actually  reported  during  the  meeting  in 
Copenhagen’s  newspaper  ‘Berlinske  Tidende’)  and  one  can  also  rewrite  this  in  many  different  ways,  so 
in  quantum  mechanics  there  are  many  technically  different  lines  of  argumentation  using  the  many 
different  representations. 

Thus  for  the  many  quantum  integrable  lattice  models  [7]  which  have  a  commuting  number  operator 
N,  [H,  A]  =  0  and  H  is  the  Hamiltonian,  there  is  the  standard  number  state  method  (SNSM)  [5,7]  and 
there  is  the  quantum  inverse  scattering  method  (QISM)  [5,7]  equivalent  to  the  Bethe  ansatz  (BA) 
method  as  conceived  by  Bethe  [8].  The  QISM  as  well  as  the  BA  can  also  be  developed  for  field 
theories,  while  there  is  the  Feynman  propagator  method  for  both  integrable  and  non-integrable  field 
theories  and  lattices  in  which  T)  (t  is  time  and  0^fs7')isa  Green’s  function  expressible  in 

terms  of  the  classical  action  S{<f>,  <I>q,  t)  as  a  path  (or  functional)  integral  on  a  Hilbert  space  [9]; 

<I>q;  T)  can  be  expressed  (with  advantage  [10])  as  a  functional  integral  on  a  symplectic  manifold  M 
(a  phase  space)  infinite  dimensional  in  the  case  of  field  theories  [10].  Wick  rotation  on  the  time,  so  that 
f— »  -if,  -i/3  yields  the  partition  function  Z(/3)  =  Tr  <f>a,  -ijS)  =  J  3)<f>o  K((f)o,  tfr^,  -i/3)  where 
2i  means  functional  integration  [10]  and  with  T  now  the  temperature.  The  free  energy 

F=-/3“'lnZ,  and  =  k^T^O  gives  the  ground  state  energy  in  thermodynamic  limit  [7,11]. 
Alternatively  and  equivalently  F=  E  —  p~'S  where  E  is  an  energy  defined  for  the  model  at 
temperature  /3  ~ '  and  5  is  an  entropy.  F  is  then  minimised  under  the  constraint  of  the  algebraic  Bethe 
ansatz  described  by  ACS  [5].  Yang  and  Yang  [12]  introduced  this  particular  method  for  the  quantum 
SM  in  the  case  of  the  integrable  quantum  repulsive  nonlinear  Schrodinger  model  (the  ‘bose  gas’).  Both 
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the  ‘bose  gas’  and  the  discrete  lattice  hose  gas  can  be  handled  this  way  [7],  With  (/f, /V]  =  0, 
£  -  /3  ~  '5  -  flN  =  -pL,  where  L  is  the  ‘volume’  in  one  dimension,  pL  is  the  chemical  potential,  and  p  is 
the  pressure.  All  of  these  methods  for  the  quantum  integrable  systems  rest  on  algebraic  methods  and 
most  especially  on  the  quantum  groups,  non-commutative  and  non-cocommutative  for  the  quantum 
integrable  models,  commutative  and  cocommutative  for  the  classical  integrable  models  [7,13,14].  Fig.  1 
in  ref.  [14]  shows  some  of  these  algebraic  connections:  the  classical  inverse  method  (top  left)  and  the 
QISM  (third  column  down  from  the  right)  are  shown,  while  our  work,  e.g.,  refs.  [15-22],  on  the 
quantum  and  classical  SM  of  the  integrable  models  is  summarised  by  the  route  from  the  Lax  pair  of  the 
inverse  method  to  periodic  boundary  conditions  (b.c.s.)  across  the  top  and  then  down  to  the  partition 
function  Z  given  by  eq.  (4)  below  and  out  to  EXPERIMENTS. 

The  functional  integral  form  of  quantum  Z  we  have  used  is  defined  on  the  symplectic  manifold  M  as 

Z  =  Tt  j  3)11  3l<f>  exp  S[n,  ((>] ,  {fl,  <f>}  =  S{x  -  x') .  (1) 

The  classical  action  S  (for  quantum  Z)  is  [10] 

ph 

S[/7,</)]  =  /i-'  I  dT(i  I dx  - //[/7,  </.])  ,  (2) 

0 

where  i  =  and  //[/7,  0]  is  the  classical  Hamiltonian.  The  classical  limit  of  this  quantum  Z  is  the 
classical  Z  given  by  (1)  with 

S[n,<t>\=-fiH[n,4>].  (3) 

This  limit  must  be  taken  so  that  ^—>0  with  zero  winding  number  [10]. 

Natural  coordinates  for  the  integrable  models  are  the  action-angle  variables  (since  5  is  classical  for 
both  quantum  and  classical  Z  we  can  use  the  action-angle  variables  as  the  natural  coordinates  on  M  in 
both  cases).  Under  this  choice,  classical  Z,  eq.  (1)  with  (3)  becomes 

Z  =  Tr  J  Si/a  exp -/3//[p] ,  (4) 

where  H[p\  depends  only  on  the  action  variables  and  3fx  is  a  measure  for  functional  integration  on  M 
to  be  determined  [10,15-22].  These  action-angle  variable  coordinates  enable  us  to  define  a  soliton 
number  and  its  mean  value  (A,)  (note  that  in  terms  of  canonical  coordinates  77,  <t>  there  is  no  means 
of  identifying  soliton  or  other  particular  contributions  (the  phonons -see  below)  to  Z). 

The  calculation  of  Z  simplifies  to  some  extent  for  lattices  (i.e.  for  integrable  lattices,  quantum  or 
classical)  where  the  functional  integral  for  classical  Z  eq.  (1)  with  the  action  (3),  becomes  a  product  of 
ordinary  integrals.  Generically  the  action-angle  variables,  under  vanishing  b.c.s  at  ±ac,  split  into 
variables  Pik),  Q{k)  labelled  by  modes  k,  together  with  ‘soliton  coordinates’  p^,  <7,  (/  =  1,  2, . .  .  ,  AJ. 
Our  method  of  calculation  exploits  the  fact  that  in  thermodynamic  limit  there  is  a  connection  between 
action-angle  variables  under  these  b.c.s.  and  action-angle  variables  under  periodic  b.c.s.  Under 
periodic  b.c.s.  the  integrable  lattice  has  a  total  number  N  of  degrees  of  freedom  (the  total  number  of 
lattice  points  in  the  period).  The  thermodynamic  limit  is  then  taken  so  that  e.g.  lim^^^,,  is  a 

non-zero  soliton  number  density. 

In  the  action-angle  variable  description  the  number  of  modes  Abodes  of  modes  k,  and  the 
number  of  solitons  satisfy  ^  for  fixed  N.  The  limit  A— » ac  is  then  taken  at  finite  density 

as  described.  For  fixed  N  one  finds  for  the  classical  integrable  models  that  generically 
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H[p]^ 


^tnoUcs 

S  caik,)P{k,)  +  lH[p,]. 

1=1  y=l 


(5) 


The  mode  sum  is  apparently  the  linearised  part  and  ot(k^)  is  the  linearised  dis[>ersion  relation.  However, 
the  mode  sum  also  provides  nonlinear  contributions  to  classical  Z,  eq.  (4)  -  as  is  exemplified  in  the 
calculations  [16]  for  the  sinh-Gordon  model.  This  is  because  the  k^  couple  to  the  P,  in  our  classical 
generalization  of  the  Bethe  equations  as  we  explain  for  the  Toda  lattice  below.  We  call  this  contribution 
to  (5)  the  ‘phonon  contribution’  in  this  paper.  The  remaining  part  of  (5)  provides  the  soliton 
contribution  to  Z. 

In  these  terms  the  classical  lattice  partition  function  Z  and  mean  soliton  number  {N^)  are  given  by 


/  ^  P  ^modes  . 

(N,)=Z-'(E^NsW)“'J  _n  dP(/c,)ndp,exp-/3//[p]), 

/V  ^  ^modes 

Z=  E  ‘  n  dP(/:,)n  dp  exp-^//[p], 

V,=  l  •^1  =  1  /  =  1 


(6) 


with  =  N.  The  (NJ)  ‘  is  due  to  Willard  Gibbs  [6].  In  thermodynamic  limit  N— ►  *  so  that 

N^N  >0.  We  can  execute  this  limit  best  by  going  to  a  ‘number  representation’  in  terms  of  densities 
Ps(P)  for  the  solitons  (J  pj^p)dp  =  \imN^N  ')  and  defining  a  ‘phonon’  density  p{k).  Total  particle 
density  is  then  /  p(k)Ak  +  /  pXp)^p,  and  we  can  then  work  with  variable  numbers  of  particles  by 
introducing  chemical  potentials  and  (say).  The  problem  posed  and  largely  solved  in  this  paper  is 
to  carry  out  this  calculation  for  the  classical  Toda  lattice  and  most  particularly  to  find  the  mean  soliton 
number  density  {N^)N~'. 

This  problem  is  of  particular  interest  for  at  least  two  reasons.  Most  importantly  for  this  meeting,  in 
refs.  [1,2]  V.  Muto,  A.C.  Scott  and  P.L.  Christiansen  calculate  this  density  numerically  for  finite  but 
quite  large  values  of  N  and  they  find  unequivocally  that  where  T  is  the  temperature.  A 

second  reason  is  that  we  had  believed  that  our  methods  for  Z  just  sketched  could  be  applied  to  all  of 
the  integrable  models  in  1  +  1  dimensions,  lattices  and  field  theories  both  quantum  and  classical.  Our 
methods  have  been  the  functional  integral  methods  sketched,  or  our  method  of  ‘generalised  Bethe 
ansatz’  [15,16,19]  which  apparently  extends  the  Yang  and  Yang  BA  method  [12]  to  the  quantum  and 
classical  SM  of  all  of  the  integrable  models  in  1  +  1  and  finds  in  many  cases  alternative  descriptions  in 
terms  of  fermi,  bose,  or  other  statistics  [10]  for  each  model.  We  reported  solutions  for  Z  and  F  for  some 
14  integrable  models  in  ref.  [19]  and  it  was  therefore  surprising  to  discover  [20]  that  the  classical  Toda 
lattice  did  not  seem  to  fit  within  these  schemes.  Particularly  seemed  impossible  given 

the  expressions  for  Z  we  were  able  to  find  [20].  There  has  been  a  considerable  variation  in  results 
already  obtained  for  the  Toda  lattice:  ref.  [2]  quotes  an  unpublished  found  by  Schneider  and  Stoll 
and  an  error  in  this  calculation  found  by  Bolterauer  and  Opper  [23],  as  well  as  {N^)N~'  =  (In  2l'n^)f,  f 
a  normalised  temperature  by  Mertens  and  Buttner  [24,25].  The  paper  most  relevant  to  ours  is  ref.  [26] 
which  finds  This  paper  used  the  classical  action-angle  variables  but  uses  these  inconsistently.  We 
refer  in  more  detail  to  this  paper  ref.  [26]  elsewhere. 


2.  Toda  lattice 

In  our  paper  we  sketch  what  appears  to  be  a  correct  ab  initio  non-phenomenological  analytic  theory 
for  the  quantity  lim^^,  (^s)^  '  and  show  that  the  behaviour  found  numerically  [1,2]  can  still  be 
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accommodated  in  our  expression  for  the  free  energy  density,  lim  FA/  '  which  also  shows  no  non- 
analytical  behaviour  in  T.  Incidentally  it  is  a  particular  pleasure  to  work  once  again  on  the  Toda  lattice 
because  Professor  Morikazu  Toda  was  present  at  this  meeting  and  himself  gave  a  paper  on  it  [4J. 

In  refs.  [1,2]  two  numerical  method  were  used:  one  of  these  was  to  create  Gaussian  chaotic  Brownian 
motion  at  a  chosen  temperature  on  a  lattice  of  N  points  under  periodic  b.c.s.  T  was  identified  by  setting 
the  mean  kinetic  energy  per  particle  =  jk^T.  In  fact  remarkably  similar  results  wc.c  found  [2]  for  a 
number  of  different  initial  distributions  of  energy  between  the  kinetic  and  potential  energy  parts  of  the 
total  energy.  For  canonical  the  general  results  of  classical  SM  for  both  linear  and  nonlinear 

systems  are,  of  course,  the  ‘virials’  {P„aHldPj  =  (P„Q„)  =  ^  ';  {Q„dHldQ„)  =  -{P„Q„)  =^"'. 
The  number  of  solitons  was  defined  and  counted  by  an  ingenious  ‘spectral  analyser’  method  which 
counts  the  number  of  eigenvalues  A  of  an  associated  spectral  problem  lying  inA>+l  orA<-l.  Our 
definition  of  soliton  number  is  through  the  action-angle  variables  and  is  therefore  essentially  identical 
to  this  definition  of  refs.  [1,2].  Thus  we  should  be  in  complete  agreement.  (Note  that  th;  definition  of  a 
soliton  in  equilibrium  or  non-equilibrium  SM  is  not  necessarily  unique  -  since  (a)  periodic  b.c.s.  are 
used  and  (b)  even  under  vanishing  b.c.s  at  the  soliton  in  the  equilibrium  SM  (thermal  equilibrium)  is 
not  the  ‘bare’  soliton  calculated  through  the  classical  inverse  method).  The  numerical  results  found  by 
this  first  method  were  unequivocally  (A/jA/  '  «  7’''^[1,2,27].  The  second  method  was  [2,27]  to  drive  the 
Toda  chain  by  a  random  force  tj„(/)  at  each  lattice  site  n.  Again  even  for  N  =  32  (cf.  fig. 

4  of  ref.  [2]).  The  spectral  analyser  was  again  used  to  count  the  solitons. 

It  is  possible  to  calculate  the  partition  function  Z  for  the  Toda  lattice  exactly.  Toda  gave  Z  for  one 
fixed  end  of  the  chain,  i.e.,  at  constant  pressure,  in  ref.  [3].  We  gave  a  closed  form  for  Z  at  constant 
volume  (two  fixed  ends)  in  ref.  [20]  as  Z  =  Zj^^^Zp^,:  in  which  a,  b  are  the  usual 

Toda  parameters  and  Zp^,  was 


Z  =  e'^^ 

^poi  ^ 


/•»  *  * 


There  are  now  2N  +  1  points  in  a  period  under  periodic  b.c.s  and  (ft  =  1) 


Z  =  (27-)-=''f  n  dP.dO.e-"''''--  ‘'-I, 

/I  =  -  Ai’  +  I 


where 


H[P„,Q„]  =  ab-'  S  (8) 

n  =  -  N 

is  a  Hamiltonian  for  a  scaled  Toda  lattice:  {F„,  Q^}  =  3„,„.  The  are  dimensionless  and  ba~^P„  is 
dimensionless;  ab~'  is  an  energy  and  /  is  scaled  as  y/abM ‘  f— »/.  The  closed  form  (7)  is  exact  and  has  a 
low  temperature  asymptotic  expansion  in  the  dimensionless  temperature  of  the  form  [20] 

Hm^  F(2N  ln[l  +  C{ba  'p  ')] 

-t-  0’(ln(2A/)/(2A/  -I-  1))  -l-  6(\/(2N  +  1)) ,  (9) 

in  which  the  [1  +  6{ba~' p~')]  is  an  asymptotic  power  series.  The  /3‘'in/3  is  the  linear  part  of  the 
phonon  contribution  and  the  /S  '  ln[-  ■  •]  is  the  nonlinear  contribution  provided  by  both  solitons  and 
phonons.  The  error  €X\/(2N  +  1))  is  usual  for  the  thermodynamic  limit  2N  +  1— »3c.  The  0(ln(2A)/ 
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{IN  +  1))  is  not  usual  and  conceals  a  divergent  (~lnN)  phonon  number  density.  However  the  free 
energy  (9)  is  well  defined. 

It  is  worthwhile  comparing  (9)  with  a  corresponding  low  T  expansion  for  the  integrable  classical 
sine-Gordon  (sG)  model  [15,17,19j.  One  can  calculate  the  sG  soliton  density  to  be  A«[8/'irr]' '  e  ‘  '  + 
corrections  f)Xr);  t  =  Mp~\  m  is  the  sG  phonon  mass  and  M  =  Smy„'  is  the  soliton  mass  (-yo  is  the 
coupling  constant).  This  is  -/3  times  the  leading  term  in  the  soliton  contribution  to  the  free  energy 
density.  On  this  line  of  reasoning  we  see  that,  since  ln(l  +  C{ba~^^)]  in  (9)  is  itself  a  simple-power 
series  in  ba~'p,  a  behaviour  for  /{2N  +  1)  nonanalytic  in  ^  '  and  proportional  to  j8  ~'  ^ 

cannot  apparently  be  accommodated.  Despite  this  situation  the  main  result  reported  in  this  paper  is, 
nevertheless,  that  'to  first  order’, 

1  /  7  \  4a 

Jim<N,>/(2Af+l)  =  -r(5)(j.)  ,  (10) 

The  result  (10)  has  also  been  given  by  F.  Marchesoni  and  G.  Lucheroni  [28J.  Unfortunately  we  are 
unable  (so  far)  to  justify  any  part  of  the  argument  these  authors  give  to  reach  this  result  though  ref.  [26] 
may  be  suggestive.  The  analysis  of  ref.  [28]  is  based  on  phonons  and  is  modelled  in  effect  on  the  sG 
analysis  [15].  We  show  here  next  that  the  thermodynamics  of  the  Toda  lattice  is  very  different  from  that 
of  sG  and  that  phonons  play  no  role  in  the  calculation  leading  to  (10).  Fig.  1  in  ref.  [28]  shows  how  well 
formula  (10)  corresponds  to  the  data  of  fig.  4  in  ref.  [2]  (for  which  2N  +  1  =32).  The  line  through  the 
actual  data  in  this  figure  1  of  ref.  [28]  was  reached  by  considering  additional  corrections  due  to 
soliton-soliton  interactions  otherwise  totally  neglected.  Note  that  our  result  (10)  is  in  thermodynamic 
limit  where  2A^  +  l^^c;  in  ref.  [2]  it  seems  that  the  factor  •7T‘'r(^)(r„)''  ’  changes  so  as  to  move  a  line 
through  the  data  on  the  \n{{N^) /{2N  +  1)  against  In  T  plot  to  the  left  as  2iV  +  1  increases  from  =32  to 
=  128  whereas  ( 10)  already  differs  from  the  data  in  fig.  4  of  ref.  [2]  by  a  missing  factor  of  =1.5,  pushing 
the  line  to  the  left.  On  the  other  hand  the  first  of  the  two  methods  which  has  2N  +  1  =64  000  is  much 
closer  to  (10).  For  the  exact  comparison  between  the  numerical  results  and  our  analytic  methods  more 
numerical  work  like  that  of  ref.  [2]  is  now  needed. 

In  ref.  [20]  we  give  more  detail  of  the  calculation  of  lim^,.,  {Nj/{2N  -l-  1)  for  sG:  we  gave  explicit 
integral  equations  whose  iteration  yields 

p,(/7)~(27r)''e-'^^‘‘'’',  (11) 

where  Efp)  is  the  excitation  energy  of  solitons  at  the  temperature  p  '  =  k^T  and  is  found  from  the 
classical  Yang  and  Yang  type  integral  equations  given  in  refs.  [15,17,19].  From  (11)  we  can  see  that  the 
soliton  contribution  to  the  free  energy  density  of  sG  is  -2/3  '  JT,  Ps{p)  dp  =  -2/3  \n{Nj/{2N  -i-  1)  = 

m[8/TTr]’  ■  e’’ '  -I-  G{t)  as  quoted  before.  The  same  connection  between  soliton  number  density  and  free 
energy  contribution  does  obtain  for  the  Toda  lattice  -  providing  we  push  all  difficulty  in  calculating  the 
resultant  total  free  energy  density  into  the  remaining  phonon  contributions.  We  noted  already  that  there 
is  no  definable  phonon  number  density  for  the  unconstrained  linearised  Toda  lattice  in  thermodynamic 
limit.  We  explain  the  phrase  'unconstrained'  in  the  following. 


3.  Action-angle  variables 

The  key  to  the  situation  is  that  the  Toda  lattice  has  a  constrained  phase  space  in  terms  of  its 
action-angle  variables  under  vanishing  b.c.s  at  infinity.  (We  have  not  yet  checked  however  that  v/e  can 
exploit  the  c  nnection  between  action-angle  variables  under  periodic  b.c.s  and  those  under  vanishing 
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b.c.s  in  thermodynamic  limit  in  this  case  of  the  Toda  lattice.  Nor  have  we  checked  that  our  result  (10) 
for  the  soliton  density  leads  finally  to  the  free  energy  density  given  by  (7)  and  (9).  Our  calculations 
which  lead  to  (10)  are  therefore  incomplete  in  these  respects.) 

The  essential  of  the  matter  under  vanishing  b.c.s  is  given  in  ref.  [29];  solitons  and.  phonons  are 
coupled  in  terms  of  their  realisable  ‘action-angle  variables’.  The  Hamiltonian  of  the  Toda  lattice  in 
terms  of  these  action-angle  variables  can  be  put  in  the  form  [29] 


H  =  2/ 


sin  k  P{k)  dk  +  i  2  {Zj  ^  -  2y  +  2  In  zj)  , 


y=i 


(12) 


but  the  variables  P{k)  and  are  constrained  by 


(13) 


Thus  there  is  the  alternative  form  for  H  in  these  variables  which  is 


(14) 


The  Zy  are  the  zeros  of  the  ‘transmission  coefficient’  a{z)  inside  the  unit  circle  lz|  =  1  and  describe  the 
solitons:  coefficients  a(z),  b{z)  are  defined  on  |z|  =  1,  z  #  ±1  so  that  |a(z)l^  -  \b{z)\^  =  1  [29].  These 
coefficients  near  z  =  ±1  are  singular  or  regular  together  [29]  so  that  b{z)  and  Zy  are  not  independent. 
This  is  tied  to  the  fact  that  one  half  of  the  infinite  set  of  constants  of  the  motion  are  inadmissible  since 
they  do  not  conserve  the  phase  space.  Consequently  the  ‘natural’  soliton  action-angle  variables 


’  9/  =  ln|y/l. 


(15) 


(for  the  definition  of  jj  see  ref.  [29])  together  with  the  natural  action-angle  variables  for  phonons 
P(k)  =  ^sinkln(l  +  lfe(e''')|'), 

<2(/c)  = -argfi(e'*) ,  0<k<'iT,  (16) 

have  [29]  the  complex  Poisson  brackets 

sin  fc 

{P(k),  Q(k')}  =8(k-k')-  [5(k)  -  Sik  -  ^)] , 

{Pik),  q^)  =  -2  sin  k  73  [S(/c)  -  S(k  -  ir)] , 

{P,>9;}=S,y,  (17) 

while  all  other  pairings  commute.  Evidently  P,  Q  are  not  canonical  action-angle  variables  and  P{k)  and 
<?y  are  coupled. 

With  these  variables  as  ‘best  possible’  action-angle  variables  under  vanishing  b.c.s,  a  situation  we  are 
investigating  further,  one  can  make  the  canonical  transformation  Zy  =  exp(-py),  0<py<'».  Then 
Py  =  2  cosh  py.  Also  ^y  =  (slnh  Py)”'  9y  preserves  the  symplectic  form  and 
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H  =  2jsinkP(k)dA  +  ^  (sinh 2p^  - 2p^) . 


(18) 


This  splits  as  in  (5)  into  phonon  and  soliton  parts.  It  is  important  to  our  result  (10)  that  sinh2p^  -  2pj 
expands  as  +  G{p^j):  H  eq.  (14)  expands  as  2pj  +  0{p^). 


4.  Statistical  mechanics  and  soliton  density 

The  constraint  (13)  on  the  phase  space  coordinatised  through  these  action-angle  variables  P(/c), 
Q{k),  Pj,  need  not  be  implemented  for  the  calculation  of  limjv_„  {N^)I{2N  +  1)  if  we  use  the  soliton 
coordinates  p,,  q^  as  free  coordinates.  The  difficulty  is  thus  pushed  into  the  calculation  of  the  phonon 
number  density  (without  constraints  apparently  undefined  in  thermodynamic  limit  -  see  by  eq.  (9))  and 
the  contribution  of  the  phonons  to  the  total  free  energy  density  which  is  defined  by  (9).  This  procedure 
can  work  because,  with  this  choice  of  variables  and  the  choice  (18)  for  H,  both  the  soliton  number 
density  and  the  soliton  contribution  to  the  free  energy  density,  which  to  leading  order  =  ~  ‘  •  {number 
density)  depend  only  on  the  soliton-soliton  interactions. 

One  can  see  this  by  looking  at  the  classical  generalised  BA,  namely  the  classical  Bethe  equations  for 
the  Toda  lattice.  These  we  find  to  be  [21]  as  follows;  Recall  that,  generically  the  quantum  Bethe 
equations  take  the  form 

giij(2N  +  l)  _  n 

i*i 

so  their  logarithm  is 

No 

{2N  +  1  )k,  =  2t. w,  -  E  A{k„  k^) ,  (20) 

where  the  n,  are  integers  1,2,...,  2N  +  1,  and  the  A  are  quantum  5-matrix  phase  shifts  in  a  fermion 
description.  We  need  A„,2A^+1-^*  in  thermodynamic  limit  with  fV„(2N -I- 1)“*  finite,  for  a  finite 
density  thermodynamic  limit.  Our  result  [21]  for  the  ‘classical’  Bethe  equations  in  corresponding 
logarithmic  form  is  the  set  of  ‘phase  shift  equations' 


{2N  +  \)k,  =  2ir«,  -  |  dit'  A{k„  k')  P{k') 

0 

>=i  i=\ 

{2N  -h  l)p;  =  2iTn;  -  s  (p;.  Pr)  -  2  K'ipJ^  Pi) , 

;=i  >■=1 

Nt  n; 

{2N  +  l)p:  =  2Ttn~  -  s  K^ipi ,  pi)  -  2  4,T(p7 ,pi). 


r=i 


;'=i 


(21a) 

(21b) 


(21c) 
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with  j'  =  j  excluded  in  (21b, c):  n,  and  nf  are  integers  and  0*  means  principal  value.  These  two 
equations  concern  at  their  left  sides  positive  (+i;e)  and  negative  {-ve)  momentum  solitons  respectively. 
There  is  coupling  of  the  two  sorts  of  soliton  to  the  phonons  through  the  phase  shifts  df  (A:,  p)  in  (21a). 
However,  the  soliton  equations  (21b,c)  are  closed  to  soliton-soliton  interactions. 

By  our  generalised  BA  method  [15,16,18,19,21],  this  set  (21)  of  ‘phase  shift  equations’  yields  for  the 
free  energy  density 

f  1  f  1 

2N  +  1  ~  2^/3  J  “  lir/B 

0  0  0 

The  phonon  excitation  energies  €(k)  are  found  from  a  nonlinear  integral  equation  coupling  phonons 
and  both  sorts  of  soliton.  But  the  corresponding  integral  equations  for  (p)  are 


QC  00 


E:(p)-e.(p)-2^{<ip 

0 


0 

with  the  obvious  corresponding  equation  for  E~(p).  These  two  coupled  integral  equations  are  closed  to 
Es(p)- 

In  (23)  £s(p)  =  sinh2p  -2p  and  the  phase  shifts,  which  appear  first  in  (21),  are  [21] 

A(k,  k')  =  {[sin(7A:)cos(^/c')]^  -  [cos(^Ac)sin(^A:')]^}'' , 

(^1  p)-  -2tan  ■‘[tan(^A:)  tanh(^p)] , 
p)  =  2tan'‘[cot({A:)  tanh(jp)] , 

K~iP^  P')  =  2  tanh’'[coth(^p)  tanh(^p')]  =  A;*{p,  p’) , 

^sV(P’P')  =  -2tanh"*[tanh(^p)tanh(^p')]  =  .d“'(p,  p').  (24) 

The  two  soliton  number  densities  are  then  given  by  lim^_„  (Nf )  /(2/V  +  1)  =  Jo  pf(p)  dp  where  pf  (p) 
satisfy 


Ps^(p)e 


PEfip)  _ 


1  1  r ,  .  ^4r(P’P')  ,, 

S - eAp) 


1  '  ^^ss^(P'P') 


Ps(P’)  . 


and  this  system  is  again  closed  to  solitons.  Note  that  E^{p)  solve  (23)  and  its  companion.  This  solution 
is  obtained  by  iteration  (assuming  iteration  is  possible).  Then  (25)  must  be  solved  by  iteration  (assuming 
that  is  possible).  So  there  are  two  iterations.  In  this  way  we  reach  the  formula  (10)  as 


{n:+n:) 

I™  2N  +  1 


J  p,(p)dp  =  -r(|)(^j  -<?(r/ro) 


R.K.  Bullough  el  al.  i  Thermodynamkn  of  Toda  lattice  models:  application  to  DNA 


91 


We  have  managed  to  evaluate  all  but  one  of  the  contributions  to  the  first  correction  to  (26)  which  is 
then  (l/48'rrV3)/3~‘  O  measured  in  {a/ by').  Note  that  the  contribution  to  the  total  free  energy 
density  of  the  constrained  phonon  modes  must  eliminate  the  free  energy  contribution 


n^^2N  +  1 


lim 


{n:+n:) 

2N  +  1 


(27) 


determined  by  (26)  since  this  goes  as  The  result  for  the  total  free  energy  must  then  be  equivalent 
to  the  asymptotic  expansion  (9).  The  problem  of  actually  demonstrating  all  of  this  looks  very  difficult. 

It  would  be  very  interesting  now  to  make  detailed  comparisons  between  numerical  estimates  for  the 
soliton  density  and  the  analytical  theory  as  we  develop  the  further  corrections  to  (26).  These  corrections 
already  tend  to  explain  the  turn-over  of  the  numerical  log-log  plots  at  higher  temperatures  but,  as 
noted,  the  numerical  calculations  found  by  the  second  method  in  ref.  [2]  are  very  far  from  the 
thermodynamic  limit  used  in  our  calculations.  Of  course  it  will  also  be  instructive  to  calculate  the  free 
energies  numerically  by  an  adaptation  of  the  second  method  of  ref.  [2]  if  this  can  be  devised. 


Conclusion:  The  formula  (10)  for  the  soliton  number  density  of  the  classical  Toda  lattice  in 
thermodynamic  limit  seems  to  be  a  good  estimate  for  small  enough  temperatures  T  in  the  range  for 
which  classical  statistical  mechanics  applies.  It  is  not  incompatible,  apparently,  with  the  free  energy 
result  at  constant  volume  and  this  result  is  in  agreement  with  Toda’s  free  energy  [3]  taken  at  constant 
pressure. 

We  have  still  to  check  that  our  use  of  the  constrained  ‘action-angle  variables’  (15)  and  (16)  for  the 
thermodynamic  limit  is  compatible  with  the  periodic  b.c.s  initially  used  for  that  thermodynamic  limit,  as 
it  is  for  the  other  models  concerned  in  ref.  19.  The  numerical  results  [1,2]  nevertheless  seem  to  provide 
yet  another  check  on  our  methods,  and  vice-versa.  There  exist  [15]  already  other  good  checks  on  these 
methods. 

Evidently  this  work  [1,2]  by  Alwyn  Scott  and  his  collaborators  is  yet  once  again  of  considerable 
interest  and  importance. 
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The  effect  of  nonlinearity  in  models  of  DNA  with  homogeneous  and  inhomogeneous  strands  is  investigated.  Numbers 
of  solitons  and  numbers  of  long-lived  open  states  as  function  of  temperature  are  computed. 


A  number  of  researchers  have  suggested  that 
anharmonicity  may  play  a  role  in  the  dynamics  of 
deoxyribonucleic  acid  (DNA)  molecules  at  phys¬ 
iological  temperatures.  A  summary  of  this  con¬ 
troversial  question  with  an  extensive  list  of  refer¬ 
ences  was  given  in  1990  in  [1].  At  the  same  time 
studies  of  a  dynamical  model  in  which  the  hydro¬ 
gen  bonds  in  the  base  pairs  of  the  DNA  mole¬ 
cule  were  modelled  by  nonlinear  springs,  gov¬ 
erned  by  Morse  potentials,  were  presented  in 
[2,3].  A  multiple  scale  perturbation  analysis 
leads  to  a  continuum  model  approximation, 
based  on  nearest  neighbors’  interactions,  in 
terms  of  the  nonlinear  Schrodinger  equation. 
Inclusion  of  long  range  linear  interactions,  how¬ 
ever,  tends  to  disperse  pulses  propagating  along 
the  strands  of  the  DNA  molecule  [4].  In  the 
DNA  molecule  there  is  a  succession  of 
adenosine-thymine  (A-T)  and  guanine-cytosine 
(G-C)  base  pairs.  The  A-T  bond  involves  two 
hydrogen  bonds  whereas  the  G-C  bond  involves 
three  hydrogen  bonds.  The  influence  of  this  type 
of  inhomogeneity  on  the  localization  and  propa¬ 
gation  of  energy  along  the  DNA  strand  was 
investigated  in  [5]. 

The  temperature  problem  for  the  nonlinear 
DNA  model  with  a  transversal  Morse  potential 
was  treated  in  [2]  by  statistical  mechanics  meth¬ 
ods.  In  [6,1]  the  DNA  molecule  is  modelled  by  a 
Toda  lattice  in  which  each  particle  represents  a 


single  base  pair  and  the  nonlinear  springs  con¬ 
necting  neighbouring  particles  represent  the  Van 
der  Waals  potentials  between  adjacent  pairs.  The 
system  was  thermalized  either  by  choosing  the 
initial  conditions  from  Gaussian  random  dis¬ 
tributions  [6]  or  by  adding  a  damping  force  and  a 
noise  force  which  simulate  the  interaction  with  a 
thermal  reservoir  at  finite  temperature,  T  [1]. 
The  resulting  Langevin  equations  are  then  per¬ 
turbed  Toda  equations  for  each  particle. 

Due  to  the  integrability  of  the  Toda  lattice,  it 
becomes  possible  to  measure  the  effect  of  the 
nonlinearity  in  a  very  concrete  manner,  namely 
by  using  the  machinery  of  the  inverse  scattering 
transform  to  count  the  number  of  solitons  e.g. 

Numerical  integration  of  the  perturbed  Toda 
equations  s^.d  subsequent  application  of  the  sol- 
iton  counter  [1]  shows  that,  when  thermal 
equilibrium  is  reached,  a  significant  number  of 
solitons,  (Vj,  is  generated  in  the  DNA  model  at 
physiological  temperatures,  namely  N^—0.31N, 
where  N  is  the  number  of  base  pairs.  Moreover, 
from  dynamical  simulations  at  different  tempera¬ 
tures  it  was  found  that  A, «  7*'^.  This  power  law 
is  observed  independently  of  the  coefficient  used 
in  the  friction  force. 

In  [7]  this  temperature  dependence  was  de¬ 
rived  by  statistical  mechanics  for  a  free  soliton 
gas.  A  linear  dependence  on  temperature  for  low 
temperatures  was  found  in  [8,9]  considering  the 
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density  of  states.  Using  the  interacting  soliton 
phonon  theory  a  T''^  law  was  found  in  [10,11]. 
However,  the  difficulty  of  physically  distinguish¬ 
ing  low-energy  Toda  solitons  from  long-wave 
length  phonons  is  underlined  in  [12].  Further 
work  based  on  the  Bethe  ansatz  is  reported  in 
[13,14].  Recent  analytical  work,  [15,16],  arrives 
at  a  law  for  the  soliton  density. 

In  order  to  make  a  more  realistic  model  of  the 
DNA  dynamics  the  molecule  must  be  described 
by  two  chains,  transversely  coupled,  where  each 
chain  simulates  one  of  the  two  polynucleotide 
strands.  The  hydrogen  bond  between  the  two 
bases  in  a  pair  can  be  modelled  by  a  Lennard- 
Jones  potential  as  suggested  in  [17, 18].  In  [19] 
this  system  was  thermalized  by  inclusion  of 
damping  and  noise  forces.  The  masses,  repre¬ 
senting  the  bases,  were  permitted  to  perform 
longitudinal  as  well  as  transversal  motions.  Al¬ 
ready,  an  unperturbed  single  Toda  chain  in 
which  the  masses  can  move  in  this  manner,  is  not 
an  integrable  system.  However,  in  the  case 
where  the  longitudinal  and  transversal  strains  are 
of  the  same  order  of  magnitude,  a  supersonic 
solitary  hybrid  wave  can  propagate  for  a  while 
without  dispersing,  if  the  velocity  and  the  param¬ 
eter,  describing  the  coupling  between  the  longi¬ 
tudinal  and  the  transversal  modes,  are  small  [20]. 
Correspondingly,  in  a  continuum  limit  based  on 
equations  of  improved  Boussinesq  type  analyti¬ 
cal  solutions,  travelling  with  a  characteristic  ve¬ 
locity,  exhibit  stable  behaviour  or  blow-up  (of 
the  longitudinal  component)  [21].  Other  super¬ 
sonic  analytical  solutions,  travelling  with  arbi¬ 
trary  velocity  [22],  can  be  shown  to  be  un¬ 
bounded.  The  existence  of  blow-up  has  recently 
been  demonstrated  by  analytical  methods  [23]. 

The  influence  of  the  specific  base  sequences 
along  the  helix  has  been  investigated  in  a  num¬ 
ber  of  papers.  Here  we  mention  a  recent  study  of 
DNA-promoter  dynamics  based  on  a  discrete 
sine-Gordon  model  [24].  Propagation  and  pin¬ 
ning  of  solitons  were  found  in  various  segments 
of  the  inhomogeneous  double  strand. 

In  the  present  work  we  consider  the  influence 


of  the  inhomogeneity  on  the  dynamics  of  the 
DNA  helix  described  by  the  Toda-Lennard- 
Jones  model.  The  inhomogeneous  Toda- 
Lennard-Jones  model  is  a  more  realistic  descrip¬ 
tion  of  the  DNA  helix  since  the  two  strands  of 
the  helix  are  described  by  two  chains  transverse¬ 
ly  coupled  and  the  inhomogeneity  of  the  mole¬ 
cule  is  taken  into  account.  As  in  [19],  the  anhar- 
monic  Van  der  Waals  potentials  are  described  by 
a  Toda  potential  with  lattice  constant  /: 

Vr(l)=-^exp(-b„l)  +  a/,  (1) 

n 

and  the  hydrogen  bonds  are  modelled  by  a  Len- 
nard- Jones  potential 

V'uW  =  44(^)"-(5)‘].  (2) 

The  index  n  appears  now  explicitly  to  indicate 
the  presence  of  inhomogeneity.  Denoting  the 
weak  A-T  base  pairs  by  W,  the  strong  G-C  base 
pairs  by  S,  and  an  averaged  base  pair  by  H,  the 
presence  of  inhomogeneities  will  be  described 
using  different  values  for  the  linear  spring  con¬ 
stant,  k„  =  ab„,  in  the  Toda  potential,  and  for 
the  Lennard-Jones  parameter,  e„,  related  to  the 
stacking  energy  and  the  hydrogen  bond  strength, 
respectively.  DNA  sequences  of  the  following 
type  have  been  considered: 


where  X  stands  for  W,  S,  or  H.  In  the  Toda- 
Lennard-Jones  model  [19,25,26]  the  linear  spring 
constants  in  the  Toda  potential  between  W  and 
S,  W  and  W,  S  and  S,  and  H  and  H  are  (a) 
A:w_s  =  1  M/m,  =  A:s_s  =29N/m,  and 

=  15.8  N/m  and  (b)  =  0.1  N/m, 

=  k^_^  =  29.9  N/m  and  “  15.8  N/m, 
respectively.  In  the  Lennard-Jones  potential  for 
W,  S,  and  H  we  use  e  =  =  0.0176  eV,  e  =  = 

0.0264  eV,  e  =  €„=  0.022  eV,  and  <r  =  2.7x 
10”‘°  m  in  case  (a)  and  (b). 

In  fig.  1  the  number  of  solitons  as  function  of 
temperature  is  shown  for  the  cases  of  inhomoge- 
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Fig.  1.  Number  of  solitons,  N,,  versus  temperature,  T,  in  '''■1 

logarithmic  scale,  for  N  =  64  masses.  Circle;  single  strand  F'g-  2.  Number  of  open  states  of  hydrogen  bonds  versus 

model  [6,1].  Bullet;  homogeneous  double  strand  model  temperature,  T,  with  life-times;  5-20psec  (open  symbols), 

(e=0.022eV,  /!c  =  15.8N/m,  o- =  2.7  x  10‘‘®m).  Diamond;  longer  than  20ps  (full  symbols).  Homogeneous  chain 

inhomogeneous  double  strand  model  for  parameter  values  of  (circles),  inhomogeneous  chain  with  prevalence  of  bases  of 

case  (a)  (empty  diamond)  and  case  (b)  (full  diamond).  strong  type  (triangles),  homogeneous  type  (diamonds),  and 

weak  type  (squares). 


neous  chains  with  prevalence  of  H  type  (X  =  H) 
for  parameter  values  of  case  (a)  (empty 
diamond)  and  case  (b)  (full  diamond),  respec¬ 
tively.  This  soliton  ratio  is  compared  with  the 
results  of  the  homogeneous  double  strand  model 
(e  =  0.022  eV,  A;=15.8N/m,  and  o-  =  2.7x 
10  "‘“m)  (bullet)  and  with  the  ones  of  the 
homogeneous  unperturbed  single  strand  model 
(circle)  [19].  The  effect  of  the  nonlinearity  is 
stronger  in  the  double  strand  model  than  in  the 
single  strand  model.  Thus  NJ N ^0A9,  0.55, 
0.45,  and  0.31,  respectively,  at  physiological 
temperature.  Results  very  similar  are  obtained 
for  chains  with  prevalence  of  strong  or  weak 
base  pairs. 

In  fig.  2  we  show  the  number  of  open  states  of 
the  hydrogen  bond  as  function  of  temperature, 
for  parameter  values  of  case  (a).  Case  (b)  is 
considered  in  [26].  Open  symbols  refer  to  life¬ 
times  5-20  psec,  while  full  symbols  correspond 
to  lifetimes  longer  than  20  psec.  It  is  seen  that 
the  presence  of  inhomogeneities  clearly  enhances 
the  hydrogen  bond  breaking.  When  bonds  of 
weak  type,  W,  prevail  most  open  states  are 
found.  Thus  at  physiological  temperatures  the 
thermal  fluctuations  of  the  surrounding  medium 
may  cause  the  hydrogen  bond  breaking  with  a 


significant  lifetime.  However,  this  does  not  imply 
that  all  bonds  are  permanently  broken.  Such  a 
state  of  full  denaturation  may  be  caused  by  other 
effects  such  as  bending  of  DNA  [27,28]. 
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100.310-E0%/91  of  the  University  of  Basque 
Country  is  acknowledged. 


References 


[IJ  V.  Muto,  A.C.  Scott  and  P.L.  Christiansen,  Physica  D 
44  (1990)  75. 

(2)  M.  Peyrard  and  A.R.  Bishop,  Phys.  Rev.  Lett.  62 

(1989)  2755.  > 

[3]  M.  Techera,  L.L.  Daemen  and  E.W.  Prohofsky,  Phys. 
Rev.  A  40  (1989)  6636. 

(4J  M.  Techera,  L.L.  Daemen  and  E.W.  Prohofsky,  Phys. 
Rev.  A  41  (1990)  4543. 

[5]  M.  Techera,  L.L.  Daemen  and  E.W.  Prohofsky,  Phys. 
Rev.  A  42  (1990)  1008. 

[6]  V.  Muto,  A.C.  S«)tt  and  P.L.  Christiansen,  Phys.  Lett. 
A  136  (1989)  33. 

[7]  T.  Schneider  and  E.  Stoll,  Classical  Statistical  Mech¬ 
anics  of  the  Toda  Lattice;  Static  and  Dynamics  Prop¬ 
erties,  private  unpublished  communication  (1980). 


96 


P.L.  Christiansen,  V.  Muto  /  Nonlinear  models  of  DN A  dynamics 


(8|  F.  Mertens  and  H.  Buttner,  Phys.  Lett.  A  84  (1981) 
155. 

(9)  F.  Mertens  and  H.  Buttner,  J.  Phys.  A  15  (1982)  1831. 

[loj  F.  Yoshida  and  T.  Sakuma,  Phys.  Rev.  A  25  (1982) 
2750. 

(llj  N.  Theodorakopoulos,  Counting  Solitons  and  Phonons 
in  the  Toda  Lattics,  in:  Statics  and  Dynamics  for  Non¬ 
linear  Systems,  eds.  G.  Benedek  et  al.  (Springer,  1983) 
p.  271. 

(12)  N.  Theodorakopoulos,  Phys.  Rev.  Lett.  53  (1984)  871. 

(131  M.  Opper,  Phys.  Lett.  A  112  (1985)  201. 

[14]  K.  Sasaki,  Phys.  Rev.  B  35  (1987)  3445. 

[15]  F.  Marchesoni  and  C.  Lucheroni,  Phys.  Rev.  B  44 
(1991)  5303. 

[16]  R.K.  Bullough.  Y.  Chen  and  J.T.  Timonen,  Physica  D 
(1993). 

[17]  P.  Perez  and  N.  Theodorakopoulos,  Phys.  Lett.  A  124 
(1987)  267. 

[18]  J.  Halding  and  P.S.  Lomdahl,  Phys.  Lett.  A  124  (1987) 
37. 


[19]  V.  Muto,  P.S.  Lomdahl  and  P.L.  Christiansen,  Phys. 
Rev.  A  42  (1990)  7452. 

[20]  P.L.  Christiansen,  P.S.  Lomdahl  and  V.  Muto.  Non- 
lineaiity  4  (1990)  477. 

[21]  P.L.  Christiansen,  S.  Rionero,  P.L.  Christiansen  and  V. 
Muto.  Stability  of  a  Solitary  Wave  Solution  to  a  Set  of 
Coupled  Boussinesq-like  Equations,  in:  Waves  and 
Stability  in  Continuous  Media,  ed.  S.  Rionero  (World 
Scientific,  1991)  p.  49. 

[22]  P.L.  Christiansen.  V.  Muto  and  S.  Rionero,  Chaos, 
Solitons  and  Fractals  2  (1992)  45. 

[23]  S.K.  Turitsyn,  Phys.  Rev.  E  47  (1993). 

[24]  M.  Salerno,  Phys.  Rev.  A  44  (1991)  5292. 

[25]  V.  Muto,  Nanobiology  1  (1992)  325. 

[26]  V.  Muto,  F.  Vadillo  and  M.  Lezaun,  to  appear. 

[27]  S.  Swaminattan,  G.  Ravishanker  and  D.L.  Beveridge,  J. 
Am.  Chem.  Soc.  113  (1991)  5027. 

[28]  J.M.  Withka.  S.  Swaminattan,  J.  Srinivasan,  D.L.  Be¬ 
veridge  and  P.H.  Bolton,  Science  255  (1992)  597. 


Physica  D  68  (1993)  97-103 
North-HoUand 

SDl:  0167-2789(93)E0173-9 


Nonlinear  science:  toward  the  next  frontiers 

James  A.  Krumhansi' 

Physics  Department,  University  of  Massachusetts,  Amherst,  MA  01002,  USA 


Nonlinear  science  has  erupted  in  many  directions  over  recent  years,  vvith  many  successes.  Two  main  themes  have  been 
found  in  many  different  settings  -  chaos  and  solitary  phenomena.  While  found  initially  in  simple  models,  useful  for 
establishing  mathematical  methods  and  the  behavior  of  exact  limiting  cases,  their  robust  features  give  us  assurance  that 
they  are  generic.  The  next  frontier  then  is  not  so  much  in  formalism,  but  in  how  to  achieve  a  realistic,  eventually 
utilitarian,  representation  of  an  actual  biological,  material,  physical,  or  chemical  system  that  connects  the  experimentally 
significant  behavior  with  its  nonlinear  properties.  An  approach  is  proposed. 


1.  Introduction 

The  thoughts  presented  here  are  an  outgrowth 
of  a  workshop  on  Future  Directions  of  Nonlinear 
Dynamics  in  Physical  and  Biophysical  Systems  at 
Lyngby,  Denmark,  July  1992.  There  is  no  doubt 
in  my  mind  that  there  is  an  important  place  for 
the  widespread  application  of  the  great  amount 
that  has  been  learned  about  nonlinearity,  but  I 
have  developed  concerns  whether  the  research  as 
it  is  now  going  will  succeed  in  the  broad  sense 
hoped  for. 

Over  the  past  twenty  five  years  nonlinear  sci¬ 
ence  has  flowered,  in  mathematics,  the  physical 
sciences,  and  engineering.  Bolstered  by  the  great 
power  of  computer  simulation,  formal  theories 
and  the  experimental  identification  of  nonlinear 
phenomena  have  made  great  progress.  The 
generic  character  of  ehaos  and  solitary  exeita- 
tions  have  been  established;  general  mathemati¬ 
cal  methods,  ever  increasing  sophistication  of 
computer  simulations,  and  powerful  new  ex¬ 
perimental  techniques,  particularly  in  traditional 
areas  of  physics  (e.g.  fluid  dynamics)  have 
achieved  marked  success. 
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In  fields  not  so  traditional,  such  as  biological 
science,  metallurgy,  and  others,  particularly 
where  mathematics  has  been  less  used,  the  rec¬ 
ognition  and  incorporation  of  modern  nonlinear 
concepts  and  methods  have  not  progressed  very 
far.  This  is  not  only  unfortunate,  but  unneces¬ 
sary.  The  challenge  of  the  next  era  in  nonlinear 
science  is  to  find  what  is  necessary  to  close  the 
gap  that  exists. 

The  root  of  the  problem  as  I  see  it,  exagger¬ 
ated  perhaps,  is  that  more  and  more  nonlinear 
research  is  becoming  either  marginal  or  irrele¬ 
vant.  aided  and  abetted  by  the  wide  availability 
of  larger  and  larger  computers,  and  the  ease  of 
formulating  variations  on  a  basic  mathematical 
theme  and  doing  one  more  case.  Indeed,  many 
of  these  incremental  exploratims  yield  fascinat¬ 
ing  special  features.  However,  the  important 
questions  are:  first,  do  they  extend  our  general 
understanding;  or,  alternatively,  do  the  special 
features  really  provide  new,  quantitative  insight 
to  some  particular  experimental  observation?  It’s 
not  clear  in  many  instances  of  published  research 
today  whether  the  answer  to  either  is  in  the 
affirmative.  Nor  do  I  mean  to  direct  my  skepti¬ 
cism  only  to  nonlinear  science,  though  the  issue 
seems  to  me  to  be  of  especial  importance  to  a 
developing  science  looking  for  opportunities  on 
the  widest  possible  front. 
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2.  Critkism 

Let  me  illustrate  two  sources  of  my  concerns, 
which  I  will  call  marginal  modeling,  and  science 
by  advertise  tuent.  To  convey  what  I  mean  by 
each  I  would  like  to  review  some  history  in  my 
own  held,  condensed  matter  physics. 

One  of  the  great  revelations  following  the 
discovery  of  quantum  mechanics  was  the  applica¬ 
tion  of  the  theory  to  periodic  crystals  and  the 
discovery  of  electronic  energy  bands,  developed 
especially  by  Bloch,  Bethe,  Slater  and  others  in 
the  1920-30S.  For  a  number  of  years  thereafter 
there  were  explorations  of  all  sorts  of  models, 
periodic  square  well  potentials,  multiple  sine 
wave  potentials,  mufhn  tin  potentials,  etc.,  but 
looked  at  objectively  it  cannot  be  said  that  any 
of  these  studies  added  very  much  substantively 
new  physics.  These  were  “marginal  models”, 
albeit  some  quite  interesting  as  puzzles  at  the 
time.  There  was  a  resurgence  of  this  topic  in  the 
19S0-60s,  but  with  new  justihcation  as  will  be 
discussed  below.  A  similar  situation  is  develop¬ 
ing  in  certain  areas  of  nonlinear  science  today, 
particularly  as  new  supercomputers  allow  the 
exploration  of  more  and  more  complex  model 
problems;  but  only  models  they  are,  for  the  most 
part.  There  are  notable  exceptions,  as  in  hydro¬ 
dynamics  where  exploration  of  singular  prop¬ 
erties,  local  structures,  and  turbulence  has  main¬ 
tained  close  and  faithful  contact  with  the  physics. 

Certainly,  there  is  a  time  in  the  development 
of  a  new  topic  when  simple,  testable  models  are 
of  great  importance  in  reconnoitering  general 
behavior.  However,  as  progress  provides  a 
generic  understanding,  models  should  make  way 
for  reality;  the  statistical  mechanics  of  one  more 
elaboration  of  a  nonlinear  potential  on  a  one 
dimensional  chain  ought  to  be  explored  only  if 
there  is  a  specific  suggestion  from  experiment  or 
other  physical  reason  to  test  that  particular  vari¬ 
ation  on  the  general  theme.  Indeed,  there  are  in 
the  literature  many  extremely  sophisticated  non¬ 
linear  simulations  which  have  been  carried  out, 
that  show  interesting  behaviors;  but  little  effort 


has  been  ex(>ended  to  seek  out  substantively 
their  presence  or  absence  in  situations  they  al¬ 
legedly  represented.  This  is  what  I  call  science  by 
advertisement. 

Again,  in  my  own  held,  to  take  an  example, 
there  have  been  a  series  of  recent,  admittedly 
titillating,  reports  on  stochastic  lattice  models  of 
the  spread  of  forest  hres.  Indeed  the  dynamics  of 
the  models  presents  neat  problems  and  behavior, 
which  might  also  apply  to  corrosion,  spread  of 
surface  infection,  and  so  forth  -  might,  that  is.  It 
is  curious,  however,  that  in  the  bibliographies  of 
the  papers  I  have  seen  so  far  the  only  references 
are  to  the  physics  or  mathematics  literature,  not 
to  forestry  publications.  Whether  or  not  the 
models  are  indeed  applicable  to  real  forest  hres, 
it  is  all  too  easy  to  entitle  the  research  as  such. 
The  same  criticism  can  be  applied  to  many  of  the 
current  nonlinear  dynamical  models  of  the  con¬ 
formations  of  biomolecules;  they  are  biology  by 
advertisement. 

If  the  only  message  I  had  were  negative,  it 
would  be  unfortunate.  That  is  not  the  case, 
however.  The  viewpoint  to  take  is  that  nonlinear 
science  has  come  a  long  way,  and  likely  can 
make  many  further,  important  contributions  to 
other  fields,  but  it  will  not  unless  the  gap  be¬ 
tween  formal  theories  or  simulations  and  the  real 
scientific  problems  in  those  other  fields  is  nar¬ 
rowed.  Here  are  some  suggestions,  with  guid¬ 
ance  from  history. 

3.  New  frontiers;  bask  rules 

Many  challenging  nonlinear  problems  arose 
originally  from  the  observation  of  some  natural 
phenomenon:  limit  cycles  in  nonlinear  coupled 
differential  equations  from  modeling  competing 
fish  populations  in  the  Baltic  Sea,  chaos  from 
modeling  weather  dynamics.  In  the  simplest 
cases  the  modeling  format  may  be  apparent  from 
the  nature  of  the  phenomenon;  that,  however,  is 
usually  not  the  case.  Assuming  that  the  objective 
is  to  have  nonlinear  dynamics  describe  reality,  1 
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jpose  the  following  check  list  for  developing 
Miels: 

U)  Identify  the  phenomenon  to  be  studied. 
)  so  only  after  consulting  with  experimentalists 
find  out  what  they  believe  to  be  important, 
itially,  what  we  think  is  important  may  not  be 
evant;  try  to  do  their  problem  first.  Reduc- 
nism  from  “first  principles”  only  hardly  ever 
•rks. 

^2)  Use  physically  realistic  variables,  i.e. 
rder  parameters”.  The  endemic  difficulty  here 
that  formal  mathematical  or  theoretical  sci- 
tists  expect  experimental  scientists  to  pull  out 
general  models  the  important  results  that 
ate  to  their  need.  This  is  unrealistic;  the  inter¬ 
ion  must  be  a  two  way  street.  The  nonlinear 
entist  must  make  the  effort  to  learn  enough 
out  biology,  metallurgy,  quantum  electronics, 
whatever  the  topic  may  be  both  to  talk  to  the 
oerimentalist  and  to  use  common  sense  in 
Mjeling.  Clearly,  it  is  important  to  find  the 
nimum  number  of  functionally  important 
ordinates,  but  not  more  -  a  clever  trick. 

'3)  Check  frequently  in  the  carrying  out  of 
lulations  whether  it  is  better  to  reconsider  the 
)deling  of  the  science  or  to  increase  the  detail 
e.  size)  of  simulation  as  the  research  goes 
ward. 

Let  me  cite  a  few  instances  by  way  of  general 
istration.  then  speak  more  specifically  to  some 
)molecular  examples.  Earlier  1  noted  the  his- 
■y  of  electronic  energy  band  physics,  and  that 
er  an  early  period  developing  the  foundations 
:re  was  a  hyatis  of  sorts,  until  the  1950-60s. 
hat  then  happened  was  the  sudden  entry  of 
niconductor  physics  and  its  application  to  elec- 
•nic  devices;  the  science  then  became  focussed 
a  few  very  specific  questions,  i.e.  (1)  above, 
d  energy  band  theory  had  a  renaissance, 
ose  in  the  field  we  recall  that  then,  instead  of 
possible  aspects  of  electron  dynamics  being 
)deled  by  theory,  process  (2)  took  place;  elec- 
;al  properties  were  well  described  by  two 
uids”,  electrons  and  holes,  with  a  few  aniso- 
ipic  effective  mass  parameters.  Meanwhile 


process  (3)  took  place  continually,  refining  the 
modeling,  the  questions  asked  of  the  simula¬ 
tions,  and  the  interpretation  of  results. 

What  about  nonlinear  dynamics  these  days? 
There  is  a  spectrum  of  answers,  depending  on 
the  area  of  application.  I  have  the  impression, 
that  in  the  field  of  hydrodynamics  and  plasma 
physics,  at  least,  that  the  three  rules  above  are 
employed  very  well;  this  may  not  be  too  surpris¬ 
ing  since  the  basic  physical  variables  are  easily 
identified,  and  the  elementary  laws  governing 
them  are  well  known.  In  this  case  reproducible, 
physically  significant  phenomena,  including  col¬ 
lective  and  local  nonlinear  behaviors,  are  found 
and  correlated  to  theory.  An  excellent  overview 
of  this  area  of  nonlinear  science  is  given  recently 
by  Rabinovich,  Fabrikant,  and  Tsimring  [1]. 

The  situation  in  other  areas  is  not  nearly  as 
well  defined;  one  close  to  my  own  interests  for 
the  past  decade  has  been  that  of  topological 
solitons  to  describe  transitional  behavior  in  dis- 
placive  structural  transformations  of  solid  ma¬ 
terials,  of  which  martensitic  materials  (hardened 
steels,  shape  memory  alloys)  are  a  particularly 
interesting  subclass.  The  early  simulations  were 
done  with  ball  and  spring  models,  but  the  subject 
has  progressed  much  further  in  quantitative  rele¬ 
vance  to  reality,  with  the  participation  of  metal¬ 
lurgists,  and  through  the  efforts  of  a  number  of 
experimental  and  theoretical  researchers  focus¬ 
sed  on  common  problems  [2,3].  The  rules  stated 
above  now  apply  there.  Nonetheless,  in  spite  of 
this  progress  toward  joining  nonlinear  and  ma¬ 
terials  science,  much  work  continues  in  the  ball 
and  spring  vein,  involving  increasingly  extensive 
simulations.  Let  me  move  on,  however,  to  the 
much  less  clear  case  of  biomolecular  nonlinear 
science. 


4.  Biomolecular  conformations;  proteins  and 
DNA 

Biology  is  concerned  with  structure  at  several 
different  levels;  primary,  the  local  molecular 
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bonding;  secondary,  the  extended  macro-molec¬ 
ular  geome  ry,  as  in  proteins  and  DNA;  tertiary, 
as  in  functional  units  such  as  hemoglobin;  and 
quaternary,  such  as  chromosomes  and  larger 
units.  Of  course  beyond  that  is  cellular  biology, 
and  so  forth.  Pause  for  a  moment  to  compare 
this  with  the  situation  in  fluid  dynamics,  which 
does  not  have  these  distinct  levels  of  mesostruc- 
ture.  There  is  plenty  of  evidence  suggesting  non¬ 
linear  behavior  at  most  of  these  levels,  but  it 
must  be  obvious  that  the  issues  and  the  physical 
phenomena  involved  are  different  between  each; 
biology  is  far  more  complex  than  fluid  dynamics. 
It  is  of  the  essence,  then,  to  select  problems 
carefully. 

For  example,  the  subject  of  energy  transport 
in  proteins  is  important  biologically.  Alwyn  Scott 
[4],  Davydov  [5],  and  others  have  in  effect  fol¬ 
lowed  rule  (1)  by  choosing  the  particular  prob¬ 
lem  of  energy  transport  in  alpha-helical  proteins, 
and  rule  (2)  by  choosing  the  amide  bond  vib- 
ronic  excitation  as  the  primary  physical  vari¬ 
ables.  Rule  (3)  has  fallen  into  place  rather  natur¬ 
ally  as  a  number  of  experiments  stimulated  by 
the  proposition  of  vibronic  solitons  were  carried 
out  over  the  past  decade.  Although  open  ques¬ 
tions  remain,  even  about  the  necessary  quantum 
formulation  of  the  nonlinear  dynamics,  much 
contact  has  been  made  with  biologists. 

The  question  of  protein  folding  conformations 
is  an  entirely  different  matter,  of  paramount 
importance  in  molecular  biology,  and  many  as¬ 
pects  remain  open.  Clearly  the  physical  variables 
involved,  the  equilibrium  criteria,  and  the  dy¬ 
namics  are  fundamentally  different  from  those 
for  the  energy  transport  problem;  but  again 
there  are  clear  signs  of  important  nonlinear  be¬ 
havior  to  be  explained.  A  very  useful  intro¬ 
duction  to  the  structure  and  dynamics  of  bio¬ 
logical  macromolecules  is  the  book  by  McCam- 
mon  and  Harvey  [6j;  a  very  recent  quantitative 
study  of  protein  dynamics,  and  the  development 
of  optimally  descriptive  coordinates,  has  been 
reported  by  Garcia  [7]  on  the  hydrated  protein, 
crambin. 


Beyond  proteins,  however,  the  beauty  and  the 
challenge  of  the  concepts  involved  in  genetic 
expression  by  the  polynucleic  acids  DNA  and 
RNA  have  captivated  our  scientific  interests,  no 
matter  what  field  we  are  in.  It  is  only  natural  that 
we  ask  the  question  whether  nonlinear  dynamics 
plays  a  role  in  the  expression  process.  A  priori 
there  is  no  particular  reason  that  this  be  the  case. 
In  the  context  of  biochemistry  the  triumph  of 
Watson,  Crick  and  Wilkins  [8]  was  to  combine 
necessary  capabilities  from  chemistry,  physics, 
and  biology  in  the  double-helical  (phosphate- 
ribose  sugar)  ladder  with  (purine-pyrimidine) 
base  pair  rungs,  appropriatedly  hydrated.  All  of 
the  pieces  were  necessary  to  simultaneously  com¬ 
bine  the  stability,  the  ability  to  store  the  genetic 
code,  and  the  capability  with  the  proper  opening 
of  the  helix  tr  transfer  a  chemical  copy  of  the 
code  to  produce  selected  proteins,  i.e.  expres¬ 
sion.  The  combined  biological  and  biochemical 
behavior  has  been  established  in  great  detail. 
The  issue  here  is  what  the  kinetics  or  dynamics 
of  the  expression  process  might  be,  for  that  is 
where  nonlinear  dynamics  might  play  a  role. 
Now,  returning  to  the  earlier  discussion,  we  must 
face  the  issue  of  how  and  what  to  model  realisti¬ 
cally. 

There  are  two  aspects  of  the  expression  pro¬ 
cess  that  one  might  focus  upon:  first,  that  base 
pair  breaking  and  opening  of  the  helix  to  expose 
the  code  sequence  occurs  in  a  localized  region, 
suggesting  a  solitary  localized  concentration  of 
strain  energy;  second,  because  it  has  been  noted 
experimentally  that  there  are  enzymes  which 
seem  to  promote  at  one  location  on  the  helix  to 
induce  transcription  at  another  location,  a  local¬ 
ized  excitation  of  some  sort  might  propagate 
from  one  region  to  the  other.  In  either  case  some 
sort  of  solitary  excitation  might  exist. 

On  the  other  hand  the  entire  process  may  be  a 
series  of  chemical  steps,  localized  chemical  re¬ 
actions  together  with  diffusion  along  the  helix 
between  steps.  Clearly,  given  the  complexity  of 
the  DNA  helix  as  a  dynamical  system  it  is  quite 
beyond  reason  to  think  of  doing  any  formal 
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neral  analysis  of  the  dynamics  seeking  some 
jcial  excitations  which  play  a  key  role  in  bio- 
>ical  function.  There  are  only  two  choices:  to 
lort  to  experiment  for  guidance  toward  im- 
rtant  motions,  or  to  attempt  complete  molecu- 
dynamic  simulations  starting  with  molecular/ 
)mic  interaction  potentials  -  or,  better,  to  use 
brmation  from  both.  But  to  proceed  it  is 
iential  to  see  the  molecular  structure  of  a 
'lA  helix. 

However,  before  going  directly  to  a  possible 
ideling  it  is  necessary  again  to  aquaint  mathe- 
iticians  and  physicists  with  some  parts  of  the 
ig  history  of  chemical  molecular  dynamics, 
pically,  in  simple  solids  the  atoms,  being 
tsely  packed,  can  only  change  their  interaction 
ergy  by  small  motions  which  change  the  dis- 
ice  between  them.  Thus,  potential  energy 
ictions  in  the  form  of  two-center  Hooke’s  law 
rings  are  not  a  bad  approximation.  Literally 
jusands  of  simulations  have  been  done  with 
«ss  and  spring  models  in  this  vein.  Unfor- 
lately  these  are  almost  completely  irrelevant 
many  situations  in  real  materials,  and  par- 
ularly  to  biological  systems;  why?  Almost  all 
^logical  material  is  polymeric,  covalently  (di- 
:tionally)  bonded,  and  with  open  structures, 
is  now  allows  a  number  of  additional  degrees 
freedom,  generally  of  much  lower  excitation 
;rgy  than  bond  stretching,  to  come  into  play, 
:h  as  bond  bending  and  twisting  about  co- 
ent  bonds.  Such  distortions  are  far  easier  to 
rite  than  bond  stretching,  typically  1-2  orders 
magnitude  lower  in  energy.  In  fact,  the  poten- 
Is  needed  for  biomolecules  are  of  the  form  [6]; 

=  K,(b-b,f 

bondfi 

+  E  K,ie  - 

bond  angles 

+  5  2  /^,i,[l  +  cos(n<^  -  S)] 

dihedral  angles 

+  2  (4-^  +  ^). 

nonbonded  V  T  Lfr  / 

pairs 


In  order,  the  terms  are,  respectively,  bond 
streching  (Hooke),  bond  bending,  bond  twisting, 
core  repulsion.  Van  der  Waals,  and  Coulombic, 
since  many  ions  are  charged.  Biological  mole¬ 
cules  must  operate  at  temperatures  of  order 
300  K,  for  which  thermal  energy  is  nowhere  near 
enough  to  break  bonds  (therefore  base  pair 
breaking  simply  by  dynamical  excitations  is  hard¬ 
ly  likely),  but  bending  and  twisting  of  biological 
macromolecules  can  easily  be  excited.  In  fact, 
today  it  has  become  clear  from  both  experiment 
and  large  scale  simulations  [9]  that  the  key  exci¬ 
tations  which  participate  in  DNA  expression  are 
the  bending  and  twisting  distortions,  which  allow 
the  molecule  to  accomodate  attachment  of  a 
suitable  enzyme,  that  in  turn  either  locally  or  in 
coordination  with  other  parts  of  the  molecule 
provides  sufficient  chemical  energy  to  open  the 
coded  base  pair  sequence. 

What  then  is  the  current  thinking  on  the  dy¬ 
namical  degrees  of  freedom  of  a  DNA  macro¬ 
molecule?  Reference  to  the  book  by  McCammon 
and  Harvey  [6]  is  a  good  place  to  start,  but  to 
provide  some  image  at  least  of  what  is  involved, 
in  fig.  1  is  shown  a  section  of  B-DNA,  together 
with  a  schematic  of  the  significant  torsional  de¬ 
grees  of  freedom  along  a  representative  section 
of  one  strand.  The  torsional  i.e.  dihedral  mo¬ 
tions  cannot  be  represented  by  pair  potentials, 
since  for  purely  geometric  reasons  it  takes  4- 
atom  positions  to  define  twisting  about  one 
bond.  The  main  components  are:  a  phosphate 
unit  (PO4)  of  which  the  P  and  two  O  are  part  of 
the  helical  ladder,  a  5-membered  ribose  sugar 
ring  (ribose;  with  or  without  an  attached  oxygen 
i.e.  oxy  or  deoxy)  one  of  whose  edges  is  part  of 
the  helix  while  opposite  corner  ties  to  the  base 
which  in  turn  connects  to  the  base  on  the  com¬ 
plementary  helix.  The  main  degrees  of  freedom 
(per  unit)  are  bending  of  PO  bonds,  twist  of  the 
ribose  ring,  and  thereby  twist  of  the  attached 
base.  These  perform  the  dominant  motions;  base 
pair  opening  is  not  likely  to  take  place  as  a  result 
of  the  intrinsic  dynamics. 

It  is  far  beyond  the  scope  of  this  short  report 
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Fig.  1.  lUPAC  nomenclature  for  backbone  torsion  angels,  ribose  endocyclic  torsions,  and  the  glycosidic  torsion.  The  backbone 
torsion  8  is  defined  by  the  four  atoms  03'-C3'-C4'-C5',  while  the  endocyclic  ribose  torsion  about  the  same  bond,  8,,  is  defined 
by  C2'-C3'-C4'-04'.  The  proper  definition  of  x  04'-Cr-Nl-C2  for  pyrimidines  and  C)4'-Cr-N9’-C4  for  purines. 


to  discourse  on  the  many  fascinating  types  of 
dynamical  and  static  distortions,  general  con¬ 
formations,  that  this  beautiful  molecular  system 
can  sustain.  To  data,  almost  all  of  what  we 
known  comes  from  “brute  force”  massive  simu¬ 
lations^'.  Are  there  possibly  generic  nonlinear 
modes?  The  answer  I  would  guess  would  be  a 

*'See  (10),  this  reference  provides  not  only  detailed  results 
of  large,  realistic  simulations,  but  an  extensive  bibliography 
as  well. 


cautious  affirmative;  the  plane  of  the  5- 
membered  sugar  ring  is  puckered  and  seems  to 
have  two  possible  minimum  energy  configura¬ 
tions,  describable  by  a  2-well  potential  function. 
These,  coupled  by  the  phosphate  ions  along  the 
backbones  might  give  solitary  kinked  bending 
modes  [11].  The  most  difficult  challenge  facing 
nonlinear  science  at  this  time  in  biomolecular 
dynamics  is  how,  if  indeed  it  is  possible,  to 
model  the  significant  biological  excitations  with  a 
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few  important  modes  of  excitation*^.  The  time  is 
near  when  we  may  be  able  to  make  progress 
toward  this  goal,  as  long  as  we  now  learn  enough 
of  the  up  the  date  molecular  biology  literature  to 
take  a  quantum  jump  toward  biologically  realis¬ 
tic  current  nonlinear  models. 
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There  is  a  growing  feeling  that  biomolecular  structure  is  not  sufficient  to  determine  biological  activity  which  is  also 
governed  by  large  amplitude  dynamics  of  the  molecules.  The  transcription  of  DMA  or  its  thermal  denaturation  are  typical 
examples.  Traditional  approaches  use  Ising  models  to  describe  the  denaturation  transition  of  DNA.  They  have  to  introduc-' 
phenomenological  "cooperativity  factors”  to  explain  the  rather  sharp  “melting”  of  this  quasi  one-dimensional  system, 
present  models  which  describe  the  full  dynamics  of  the  melting.  Using  molecular  dynamics  simulations  and  statistical 
analysis,  we  discuss  the  mechanism  of  the  denaturation,  including  precursor  effects  that  can  be  related  to  large  amplitude 
localized  nonlinear  excitations  of  the  molecule  in  which  discreteness  effects  play  a  large  role.  We  also  show  the  microscopic 
origin  of  the  cooperativity  factors. 


1.  Introduction 

Nucleic  acids  are  the  repository  of  genetic 
information  and  each  of  the  units  that  compose 
DNA  or  RNA  molecules  plays  an  essential  role 
in  the  biological  functions.  The  famous  discovery 
of  the  double  helix  has  emphasized  a  strong 
relationship  between  structure  and  function  in 
molecular  biology.  However  this  structure, 
which  is  so  well  designed  to  include  the  genetic 
code  in  two  complementary  strands  and  protects 
it  against  external  perturbations,  would  also  pre¬ 
vent  the  expression  of  the  code  if  the  molecule 
were  static  because  the  coding  bases  are  not 
directly  accessible  to  chemical  reaction.  There 
are  however  many  indications  that  DNA  is  a 
very  dynamical  entity,  undergoing  very  large  de¬ 
formations  and  should  not  be  viewed  merely  as  a 
solid  with  a  particular  structure. 

A  typical  example  in  which  the  dynamics  of 
the  molecule  is  essential  for  its  function  is  DNA 
transcription  during  which  a  segment  of  the  gen¬ 
etic  code  is  copied  into  RNA.  In  order  to  expose 


the  coding  bases  to  chemical  reaction,  the  double 
helix  unwinds  locally  and  forms  a  “bubble” 
which  is  about  20  base-pair  long  and  moves 
along  the  molecule  as  the  transcription  proceeds. 
This  complex  process,  which  is  activated  by  an 
enzyme,  is  still  beyond  a  physical  analysis  but  it 
has  strong  similarities  with  the  early  stage  of  the 
termal  denaturation,  or  “melting”  of  the  double 
helix.  The  melting,  which  is  the  separation  of  the 
two  complementary  strands,  starts  locally  by  the 
formation  of  small  denaturated  regions  very 
similar  to  the  transcription  bubble.  Another  im¬ 
portant  motion  of  the  DNA  molecule  is  its 
“breathing”  or  fluctuational  opening.  In  these 
very  large  fluctuations,  base-pairs  are  temporari¬ 
ly  broken  and  the  two  bases  are  exposed  for 
chemical  reaction  for  a  very  short  time  (10"’ s). 
These  fluctuational  openings  can  be  considered 
as  intrinsic  precursors  for  the  denaturation  and 
they  could  play  a  i  .c  in  carcinogenesis  by  exter¬ 
nal  molecules  [1]. 

The  molecular  deformation  involved  in  melt¬ 
ing  or  in  fluctuational  openings  are  so  large  that 
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they  cannot  be  described  by  linear  approxi¬ 
mations.  Therefore  biomolecular  dynamics  is  a 
fascinating  topic  for  nonlinear  science  because  it 
is  related  to  basic  phenomena  of  life  and  we 
know  that  it  has  to  be  fundamentally  nonlinear. 
We  discuss  here  some  aspects  of  DNA  dynamics 
and  we  show  that  simple  nonlinear  models  can 
provide  a  good  description  of  the  large  am¬ 
plitude  distortions  of  the  molecule  which  are 
observed  experimentally.  Our  basic  approach 
can  be  viewed  as  an  extension  of  the  Ising  mod¬ 
els,  which  have  been  widely  used  to  study  the 
statistical  mechanics  of  the  melting,  in  which  we 
treat  completely  the  dynamics  of  the  bases.  In 
section  2  we  present  our  basic  model  and  its 
statistical  mechanics.  This  section  makes  the  con¬ 
nection  with  the  usual  Ising  models  for  DNA. 
Section  3  studies  the  dynamics  of  the  model. 
Molecular  dynamics  is  used  to  detect  the  main 
types  of  large  amplitude  motions  and  connect 
them  with  the  experimental  observations.  Then 
we  propose  two  analytical  investigations  adapted 
to  the  description  of  the  fluctuational  openings 
and  denaturation  bubbles.  Section  4  discusses 
more  precisely  the  thermodynamics  of  the  melt¬ 
ing  of  this  one  dimensional  object,  the  DNA 
molecule.  We  show  how  the  introduction  of  non¬ 
linear  coupling  terms  to  describe  the  base  stack¬ 
ing  interactions  is  essential  to  explain  the  sharp 
melting  transition  observed  experimentally. 

2.  A  simple  model  for  DNA  melting  and  its 
statistical  mechanics 

The  simplest  description  of  DNA  melting  rep¬ 
resents  a  base-pair  by  an  Ising-like  variable 
which  takes  only  two  values,  0  and  1,  i.e.,  closed 
and  open.  The  denaturation  transition  is  then 
analyzed  by  treating  the  statistical  mechanics  of 
this  one-dimensional  Ising-spin  chain  [2].  The 
structure  of  DNA  appears  in  the  calculation  of 
the  statistical  weight  of  each  state  of  the  mole¬ 
cule  which  is  expressed  as  the  product  of  a 
stability  parameter  for  each  base-pair,  a 


cooperativity  parameter  taking  into  account  the 
fact  that  breaking  a  base-pair  destroys  two  stack¬ 
ing  interactions  unless  the  pair  is  the  terminal 
pair  of  an  open  region,  and  an  entropy  parame¬ 
ter  for  each  loop  measuring  the  “stiffness”  of  the 
DNA  strands.  Many  varieties  of  these  Ising- 
models  have  been  presented.  For  instance  the 
stability  parameter  can  be  assigned  to  base-pair 
doublets  rather  than  to  a  single  base-pair  be¬ 
cause,  as  we  discuss  in  the  last  section,  there  is  a 
strong  relationship  between  the  stacking  inter¬ 
action  of  adjacent  bases  and  the  stability  of  the 
pairs.  The  model  parameters  are  determined 
phenomenologically  in  order  to  get  the  best  pos¬ 
sible  agreement  between  the  theoretical  predic¬ 
tions  and  the  experimental  melting  curves.  Once 
a  particular  model  has  been  calibrated  it  may  be 
used  to  predict  the  melting  curve  of  another 
DNA  segment.  The  success  of  this  approach  to 
reproduce  experimental  melting  curves  is  impres¬ 
sive,  but  it  involves  a  large  number  of  adjustable 
parameters.  For  instance.  10  parameters  are 
used  to  represent  the  10  possible  types  of  base- 
pair  doublets  along  the  molecule.  Moreover, 
using  an  Ising-variable  prevents  any  attempt  to 
describe  the  dynamics  of  the  fluctuational  open¬ 
ings  since  states  intermediate  between  closed  and 
open  cannot  be  represented. 

Our  approach  goes  further  but  still  keeps  the 
model  as  simple  as  possible  in  an  attempt  to 
determine  the  fundamental  mechanism  of  the 
melting.  Therefore  we  consider  a  simplified 
geometry  for  the  DNA  chain  in  which  we  have 
neglected  the  asymmetry  of  the  molecule  and  we 
represent  each  strand  by  a  set  of  point  masses 
that  correspond  to  the  nucleotides.  The  charac¬ 
teristics  of  the  model  are  the  following; 

(i)  The  longitudinal  displacements  are  not  con¬ 
sidered  because  their  typical  amplitudes  are  sig¬ 
nificantly  smaller  than  the  amplitudes  of  the 
smaller  transverse  ones  [3].  The  stretching  of  a 
base-pair  in  the  transverse  direction  is  repre¬ 
sented  by  a  real  variable  y„  which  can  therefore 
describe  all  the  states  of  the  pair  from  closed 
(y„  =  0)  to  completely  broken. 
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(ii)  Two  neighboring  nucleotides  of  the  same 
strands  are  connected  by  an  harmonic  potential 
to  keep  the  model  as  simple  as  possible.  On  the 
other  hand,  the  bonds  connecting  the  two  bases 
belonging  to  different  strands  are  extremely 
stretched  when  the  double  helix  opens  locally  so 
that  their  nonlinearity  must  not  be  ignored.  We 
use  a  Morse  potential  to  represent  the  transverse 
interaction  of  the  bases  in  a  pair.  It  describes  not 
only  the  hydrogen  bonds  but  the  repulsive  inter¬ 
actions  of  the  phosphate  groups,  partly  screened 
by  the  surrounding  solvent  as  well.  The  Hamilto¬ 
nian  of  the  model  is  then  the  following: 

n 

+  D(e-''''--l)^J.  (1) 


Since  we  are  interested  in  the  thermal  denatu- 
ration  transition  of  the  molecule,  the  natural 
approach  is  to  investigate  the  statistical  mech¬ 
anics  of  the  model.  Due  to  the  one-dimensional 
character  of  the  system,  and  because  the  interac¬ 
tions  are  restricted  to  nearest  neighbor  interac¬ 
tions,  it  can  be  treated  exactly,  including  fully 
the  nonlinearities,  with  the  transfer  operator 
method  [4]. 

For  a  chain  containing  N  units  with  nearest 
neighbor  coupling,  the  classical  partition  func¬ 
tion,  given  in  terms  of  the  Hamiltonian  (1),  can 
be  expressed  as 


+  30  ^ 
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where  fiy„,  y„-i)  is  the  potential  part  of  the 
Hamiltonian.  The  momentum  part  are  readily 
integrated  to  give  the  usual  kinetic  factor  for  N 
particles  2fp  =  The  potential  part 

can  be  evaluated  exactly  [5-7]  in  the  thermo¬ 
dynamic  limit  of  a  large  system  using 


the  eigenfunctions  and  eigenvalues  of  the  trans¬ 
fer  integral  operator 

(3) 

The  calculation  is  similar  to  the  one  performed 
by  Krumhansl  and  Schrieffer  [6]  for  the  statisti¬ 
cal  mechanics  of  the  <f>*  field.  It  yields  = 
expi-Nfi^o),  where  is  the  lowest  eigenvalue  of 
the  operator.  We  can  then  compute  the  free 
energy  of  the  model  as  -k^T\n2t  = 
~(iNk^T)ln(2Ttmk^T)  +  and  the  specific 
heat  C^  =  -T{d^^ldT^).  The  quantity  which 
gives  a  measure  of  the  extent  of  the  denaturation 
of  the  molecule  is  the  mean  stretching  (y^)  of 
the  hydrogen  bonds,  which  can  also  be  calcu¬ 
lated  with  the  transfer  integral  method  [5]  and 
yields 


(y)  =  (:vm>  =  4? - — 

=  <<^o(y)|yko(>')> 

=  /  ‘^»o(y)ydy,  (4) 

since  in  the  limit  of  large  N  tiie  result  is  again 
dominated  by  the  lowest  eigenvalue  €„  associated 
with  the  normalized  eigenfunction 
In  the  continuum  limit  approximation,  the  TI 
eigenvalue  problem  can  be  solved  exactly,  but 
experiments  on  proton  exchange  in  DNA  [8] 
show  some  evidence  of  exchange  limited  to  a 
single  base  pair  which  suggests  that  discreteness 
effects  can  be  extremely  large  in  DNA.  There¬ 
fore  we  have  solved  numerically  the  eigenvalue 
equation  of  the  transfer  operator  [9]  without 
approximations.  The  TI  operator  is  symmetrized 
and  the  integral  is  replaced  by  sums  of  discrete 
increments,  using  summation  formulas  at  differ¬ 
ent  orders.  The  problem  is  then  equivalent  to 
finding  the  eigenvalues  and  eigenvectors  of  a 
symmetric  matrix. 
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Fig.  1.  Variation  of  (y)  versus  temperature:  the  dash  line 
corresponds  to  the  TI  results  in  the  continuum  limit,  the  solid 
line  gives  the  exact  TI  results  obtained  by  numerical  solution 
of  the  TI  operator,  and  the  plus  signs  correspond  to  molecu¬ 
lar  dynamics  simulations. 


Figure  1  compares  the  thermal  evolution  of 
{>'„)  obtained  with  the  continuum  approxi¬ 
mation  and  the  exact  numerical  calculation,  for 
the  model  parameters  discussed  in  the  next  sec¬ 
tion.  Both  methods  show  a  divergence  of  the 
hydrogen  bond  stretching  over  a  given  tempera¬ 
ture,  but  the  melting  temperature  given  by  the 
numerical  treatment  is  significantly  higher,  point¬ 
ing  out  the  large  role  of  discreteness  in  DNA 
dynamics  if  one  uses  realistic  parameters  for  the 
model.  The  TI  calculation  shows  that  the  specific 
heat  has  a  broad  maximum  around  the  denatura- 
tion  temperature. 


3.  Dynamics  of  the  DNA  molecule 

The  thermodynamics  of  our  DNA  model 
shows  that  it  exhibits  a  thermal  evolution  that  is 
qualitatively  similar  to  the  denaturation  of  the 
molecule  observed  experimentally.  But  this 
statistical  approach  does  not  give  information  on 
the  mechanism  of  the  denaturation,  and  in  par¬ 


ticular,  does  it  start  locally  by  the  formation  of 
denaturation  bubbles  in  agreement  with  the  ex¬ 
periments.  In  order  to  study  this  aspect,  we  have 
investigated  the  dynamics  of  the  model  in  con¬ 
tact  with  a  thermal  bath  by  molecular  dynamics 
simulation  with  the  Nose  method  [10-11]. 

Beginning  with  the  Hamiltonian  H  and  the 
2iV-dimensionai  phase  space  of  a  chain  of  N 
base-pairs  with  periodic  boundary  conditions, 
the  fixed  temperature  canonical  ensemble  can  be 
simulated  by  the  addition  of  a  single  variable  s, 
which  regulates  the  energy  flows,  and  an  addi¬ 
tional  parameter  M,  which  fixes  the  scale  of  the 
temperature  fluctuations.  Nose  demonstrated 
that  in  this  phase  space  of  the  extended  Hamilto¬ 
nian  H\  the  microcanonical  ensemble  of  H'  is 
precisely  the  canonical  ensemble  of  H  at  tem¬ 
perature  T.  This  property  is  only  exact  for 
equilibrium  properties,  but  investigations  cur¬ 
rently  in  progress  [12]  show  that,  provided  that 
the  characteristic  time  of  the  Nose  thermostat, 
controlled  by  M,  is  properly  chosen,  it  can  also 
give  reliable  results  for  the  dynamical  properties. 
Most  of  the  simulations  have  been  performed 
with  a  chain  of  256  base-pairs  with  periodic 
boundary  conditions,  but  in  order  to  achieve 
better  statistics,  some  simulations  have  been  per¬ 
formed  on  a  Connection  Machine-200  with  16384 
base-pairs.  Equations  are  integrated  with  a  4th 
order  Runge-Kutta  scheme  with  a  time-step 
chosen  to  conserve  //'  to  an  accuracy  better  than 
0.001%  during  a  run. 

We  have  chosen  a  system  of  units  adapted  to 
the  energy  and  time  scales  of  the  problem.  Ener¬ 
gies  are  expressed  in  eV,  masses  in  atomic  mass 
unit  (a.m.u.)  and  length  in  A.  The  resulting  time 
unit  is  1  t.u.  =  1.0214  X  lO''"*  s.  The  choice  of 
appropriate  model  parameters  is  a  very  con¬ 
troversial  topic,  as  attested  by  the  debate  in  the 
literature  [13].  There  are  well  established  force 
fields  for  molecular  dynamics  of  biological  mole¬ 
cules,  but  they  have  been  designed  to  provide  a 
good  description  of  the  small  amplitude  motions 
of  the  molecule  and  are  not  reliable  for  the  very 
large  amplitude  motions  involved  in  the  denatu¬ 
ration.  In  our  model,  the  Morse 
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potential  is  an  effective  potential  which  links  the 
two  strands.  It  results  from  a  combination  of  an 
attractive  part  due  to  the  hydrogen  bonds  be¬ 
tween  two  bases  in  a  pair  and  the  repulsive 
interaction  between  the  charged  phosphate 
groups  on  the  two  strands.  The  potential  for  the 
hydrogen  bonds  can  be  rather  well  estimated  [14] 
but  the  repulsive  part  is  harder  to  determine 
because  the  repulsion  is  partly  screened  by  ions 
of  the  solvent.  Consequently  we  had  to  rely  on 
estimations.  The  parameters  that  we  use  have 
been  chosen  to  give  realistic  properties  for  the 
model  in  terms  of  vibrational  frequencies,  size  of 
the  open  regions,  etc.,  but  future  work  will  be 
needed  to  confirm  our  choice.  We  do  not  expect 
however  that  a  better  choice  would  change  quali¬ 
tatively  the  results  presented  here.  The  parame¬ 
ters  that  we  have  chosen  are;  a  dissociation 
energy  D  =  0.04  eV,  a  spatial  scale  factor  of  the 
Morse  potential  a  =  4.45  A a  coupling  con¬ 
stant  K  =  0.06  eV/ A,  a  mass  m  =  300  a.m.u.  The 
constant  of  the  Nose  thermostat  has  been  set  to 
Af  =  1000. 

A  first  scan  of  the  dynamics  of  the  model  is 
obtained  by  imposing  a  slow  temperature  ramp 
(200-540  K)  that  generates  sets  of  states  which 
are  approximately  equilibrated  and  are  used  then 
as  initial  states  for  simulations  at  constant  tem¬ 
perature.  Figure  2  shows  a  time  evolution  of  the 
dynamics  of  the  model  at  three  temperatures. 
The  stretching  of  the  base-pairs  is  indicated  by  a 
grey  scale,  darker  dots  corresponding  to  larger 
stretching.  Looking  at  this  figure,  one  notices 
immediately  two  major  features.  First,  as  one 
moves  along  an  horizontal  direction,  i.e.,  along 
the  molecule  for  a  given  time,  the  amplitude  of 
the  stretching  varies  very  much  from  site  to  site. 
This  is  especially  true  at  high  temperature,  but  it 
is  still  noticeable  at  150  K,  well  below  the  melt¬ 
ing  temperature.  This  shows  that  there  is  no 
equipartition  of  energy  in  this  nonlinear  system, 
but  on  the  contrary  a  tendency  for  the  energy  to 
localize  at  some  points  which  is  more  and  more 
pronounced  as  temperature  increases.  At  high 
temperature,  the  figure  shows  large  black  regions 
which  correspond  to  denaturated  regions  of  the 


molecule.  These  black  areas  are  the  denaturation 
bubbles  observed  experimentally.  At  the  highest 
tem{>erature  shown  here  (fig.  2c)  they  extend 
over  20  to  50  base  pairs  and  their  boundaries  are 
sharp. 

If  the  temperature  is  raised  slightly  above 
540  K,  the  bubbles  grow  even  more  and  finally 
extend  over  the  whole  chain:  the  molecule  is 
completely  denaturated.  The  second  remarkable 
feature  on  fig.  2  can  be  observed  by  moving 
along  a  vertical  line  on  the  figure,  i.e.,  following 
the  time  evolution  of  a  given  base  pair.  If  one 
choses  one  region  of  the  molecule  in  which  the 
energy  is  concentrated,  one  can  see  alternating 
black  and  light-grey  dots.  This  is  due  to  an 
internal  breathing  of  the  localized  excitations  that 
oscillate  between  a  large  amplitude  (black  dots 
in  the  figure)  and  a  small  amplitude  state  (light 
dots)  in  a  regular  manner.  These  motions  are  the 
fluctuational  openings  of  DNA.  They  exist  even 
well  below  the  denaturation  temperature  and 
coexist  with  denaturated  bubbles  in  the  high 
temperature  range.  Figure  2b  shows  that  they 
play  the  role  of  precursor  motions  for  the  forma¬ 
tion  of  the  bubbles. 

The  calculation  of  the  dynamical  structure  fac¬ 
tor  from  the  molecular  dynamics  results  exhibits 
two  types  of  excitations.  In  the  high  frequency 
range,  one  recognizes  the  phonon  modes  corre¬ 
sponding  to  linear  motions  of  the  chain.  At  low 
temperature  their  dispersion  curve  is  well  de¬ 
scribed  by  the  linear  dispersion  curve  resulting 
from  the  equations  of  motions  of  the  model. 
Close  to  melting,  on  the  contrary,  most  of  the 
chain  is  on  the  plateau  of  the  Morse  potential 
and  therefore  experiences  almost  no  restoring 
force  that  brings  it  back  to  y  =  0.  The  dispersion 
curve  is  then  the  dispersion  relation  of  a  chain  of 
harmonically  coupled  particles,  without  a  sub¬ 
strate  potential,  i.e.,  a  dispersion  relation  with¬ 
out  gap.  The  variation  versus  temperature  of  the 
frequency  of  main  phonon  peak  at  wavevector 
q  =  jTi  plotted  on  fig.  3  shows  clearly  the  transi¬ 
tion  between  a  frequency  belonging  to  the  origi¬ 
nal  dispersion  curve  at  low  T  toward  that  of  a 
gapless  dispersion  curve.  This  phonon  softening 


should  be  observable  experimentally  in  the  vic¬ 
inity  of  DNA  melting  transition.  The  second 
characteristic  feature  of  the  dynamical  structure 
factor  is  a  low  frequency  peak,  associated  to  the 
fluctuational  opening,  which  shifts  to  zero  fre¬ 


quency  as  the  denaturation  bubbles  form  near 
the  melting  point. 

Since  the  molecular  dynamics  simulations  have 
found  the  two  types  of  motions  observed  ex¬ 
perimentally.  fluctuational  openings  and  bub- 
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Fig.  3.  Frequency  of  the  phonon  modes  versus  temperature 
for  q  =  Itr.  The  error  bars  indicate  an  interval  where  the 
frequency  lies,  at  those  temperatures  for  which  a  single  mode 
cannot  be  identified.  The  horizontal  lines  correspond  to  the 
phonon  frequency  at  the  bottom  of  the  well  (dashed)  and  on 
the  Morse  plateau  (dash-dot-dot-dot). 

bles,  it  is  interesting  to  see  whether  an  analysis  in 
terms  of  nonlinear  excitations  can  explain  these 
motions. 

The  fluctuational  openings  correspond  to  large 
amplitude  breathing  modes  which  are  localized 
by  nonlinear  effects.  As  they  extend  only  over  a 
few  base  pairs,  they  are  intrinsically  discrete. 
Their  existence  and  long  term  stability  poses  the 
general  question  of  the  existence  of  breathers  in 
discrete  Klein-Gordon  models  which  has  already 
attracted  a  great  deal  of  attention  without  receiv¬ 
ing  a  definite  answer  [15].  An  analytical  investi¬ 
gation  of  the  DNA  model  presented  above  is 
difficult  due  to  the  Morse  potential,  but,  since 
the  fluctuational  openings  are  intermediate  am¬ 
plitude  oscillations,  their  study  can  be  performed 
with  a  simpler  potential  that  has  qualitatively  the 
shape  of  the  Morse  potential  for  small  and  inter¬ 
mediate  amplitude  but  is  more  suitable  analyti¬ 
cally.  The  potential 

V{y„)  =  D{\yl-a-\yl),  (5) 


for  >'„  s  1/a  is  convenient.  Introducing  the  vari¬ 
able  -  ay„  and  the  scaled  time  t  =  VKIm  t, 
and  defining  the  parameter  <al  =  D/K,  which 
measures  the  discreteness,  the  Hamiltonian  of 
this  simplified  models  becomes 

^  S[2W'^+ 

a  n 

+  ^40^-  (6) 

Large  amplitude  localized  breathers  can  be  ob¬ 
tained  for  this  model  [16]  by  the  lattice-Green 
functions  method  [17]  developed  recently  to 
study  local  modes  in  nonlinear  lattices.  We  seek 
long-lived  oscillatory  solutions  under  the  form 

oc 

W'„  =  S  <l>‘„  cos(ia>bO  (7) 

i  =  0 

where  tui,  is  the  eigenfrequency  of  the  fundamen¬ 
tal  mode  (i  =  1)  and  the  time  independent 
amplitude  of  the  ith  mode  (the  ansatz  has  a  dc 
part  simply  because  of  the  asymmetry  of  the 
potential).  Inserting  this  ansatz  in  the  equations 
of  motion  and  setting  the  coefficients  of  cos(i(«>bO 
equal  to  one  another  and  retaining  only  the  first 
three  terms,  which  gives  already  a  good  appioxi- 
mation,  we  obtain: 

=  ] .  (8) 

(a»d  -  wl)<f>'„  -  -I- 

=  "^(20°  +  ,  (9) 

("d  “  +  4>l-i  -  2<f>l) 

=  <01(24,14,1  + (10) 

Using  the  lattice  Green’s  function  of  the  linear 
part  of  the  above  equations, 

1  ^  g<<?(n-'n) 

G(n  -  m,  -  Z  ^7"; - (H) 

/V  ,  (o  (q)-  (o^, 
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where  q  is  the  wave  vector  inside  the  first  Bril- 
fouin  zone,  <t>\  can  be  expressed  in  terms 

of  the  r.h.s.  of  eqs.  (8)-(10),  which  gives  a  set  of 
simultaneous  nonlinear  eigenvalue  equations  de¬ 
termining  the  eigenfrequency  (o^,  and  the  eigen¬ 
functions  They  can  be  solved  numerically  by 
iterations,  starting  with  appropriate  initial  condi¬ 
tions.  We  fix  the  dc  term,  which  amounts  to 
choosing  a  particular  amplitude  for  the  solution. 
Only  15  iterations  are  necessary  to  determine  the 
values  with  an  accuracy  of  10“'*.  A  numerical 
simulation  of  the  dynamics  of  the  chain  with  the 
solution  determined  as  above  as  an  initial  condi¬ 
tion  shows  that  a  large  amplitude  breather  local¬ 
ized  on  very  few  lattice  sites  is  extremely  long 
lived,  in  agreement  with  the  results  of  the  molec¬ 
ular  dynamics  simulations  of  our  DNA  model. 
Consequently  the  very  narrow  fluctuational 
openings  observed  experimentally  in  DNA  [8] 
and  in  our  molecular  dynamics  investigations 
could  well  be  discrete  breathers  stabilized  by 
nonlinearity. 

The  study  of  the  denaturation  bubbles  turns 
out  to  be  much  more  difficult  because  they  can¬ 
not  be  studied  independently  of  the  thermal 
effects.  Due  to  the  shape  of  the  Morse  potential, 
at  r  =  0,  any  large  amplitude  opening  bringing  a 
set  of  neighboring  base-pairs  on  the  plateau  of 
the  Morse  potential  is  unstable.  This  initial  state 
would  oscillate  at  very  low  frequency,  but  cannot 
stay  in  an  open  state  since  the  bases  in  the  open 
region  are  called  back  to  their  closing  state  by 
the  bases  which  are  still  closed  in  the  molecule 
and  by  the  small  downward  slope  of  the  Morse 
plateau.  Therefore  the  existence  of  long  lived 
open  bubbles  in  our  model,  as  they  are  seen  in 
the  molecular  dynamics  simulations,  is  fun¬ 
damentally  a  thermal  effect  which  cannot  be 
studied  by  Hamiltonian  mechanics,  as  we  did  for 
the  fluctuational  openings.  One  way  around  this 
difficulty  is  to  include  temperature  effects  in  the 
potential  itself.  This  is  exactly  what  the  self- 
consistent  phonon  (SCP)  method  does  using  a 
trial  Hamiltonian  for  the  calculation  of  the  free 
energy  of  the  system  at  temperature  T  [18]. 


Introducing  u„  =  y„  -  (y)  =  y„-q  and  two  pa¬ 
rameters  and  <^,  we  apply  the  SCP  method 
[18]  by  considering  the  trial  harmonic  Hamil¬ 
tonian 

=  2  [imul  +  i<f>{u„  ~  u„^^f  +  . 

(12) 

The  canonical  partition  function  S  can  be  ex¬ 
pressed  as  the  product  of  the  unperturbed  parti¬ 
tion  function  and  a  perturbation  factor  i?,; 

3r  =  JridM.e'^’' 

The  perturbation  series  for  f?,  is  calculated  by 
expanding  the  exponential,  and  the  logarithmic 
function;  the  coefficient  of  (-^)7n!  in  the  ex¬ 
pansion  of  In  is  called  the  nth  cumulant  and  is 
written  [19]  <(//-//„)")  o.c-  Thus 

^=-ksTlnS 
=  -k^TlnSo- 

n=l 

=  ^0  -f  ^2  ■*■  ■  ■  ’  • 

The  first  contribution  ‘s  the  contribution  of  N 
harmonic  oscillators, 

3^o=-^^BT2ln?^.  (13) 

,=o  <o  iq) 

It  can  be  shown  [20]  that  the  variational  free 
energy  ^  gives  an  upper  bound 

for  the  actual  free  energy.  The  only  difficulty  in 
evaluating  is  the  self-consistent  substrate 
potential.  It  can  be  evaluated  and  gives  an  “ef¬ 
fective”  potential  that  keeps  the  shape  of  the 
Morse  potential,  but  with  a  minimum  which  is 
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temperature  dependent.  The  expression  for  the 
first  order  correction  for  the  free  energy  is  then: 


9,  =  {K-,t>){{u-)-{v-))-\n-W) 

(14) 


with  («•)  =  {ur„)  and  {v-)  =  {u„u„^t).  Consid¬ 
ering  rj,  (m“)  and  (v~)  as  variational  parame¬ 
ters,  the  conditions  for  to  be  stationary  give 
Tj  =  la{u~),  <f>  =  K  and  =  2a'D  exp(-  la-q). 
As  T)  is  an  increasing  function  of  temperature, 
o’  decreases  with  T,  which  corresponds  to  the 
mode  softening  observed  in  the  molecular  dy¬ 
namics  simulations.  Since  (u^)  and  17  are  re¬ 
lated,  the  minimization  of  amounts  simply  to 
solving  the  equation 


77  = 


2akgT 

2N 


E  l2fl^D  e 


24TJ  ‘  3 


+  AK  sin’('rrp/N)J''  . 


(15) 


In  practice,  eq.  (15)  is  solved  by  a  simple 
bisection  method  and  fig.  4  shows  the  free  ener¬ 
gy  versus  77  at  different  temperatures.  We  see 


Fig.  4.  Free  energy  d*',  versus  tj  at  different  temperatures: 
r  =  350  (dashed),  380  (dash-dotted),  411  (solid).  430  (dash- 
dot-dot-dot)  K. 


clearly  that  the  self-consistent  solution,  which 
corresponds  to  the  metastable  minimum  of  the 
dashed  and  dash-dotted  curves,  disappears  at 
7^,  =  411  K  to  give  a  strictly  decreasing  function 
of  77.  Over  the  only  minimum  is  obtained  for 
an  infinite  value  of  77,  which  does  not  correspond 
to  a  self-consistent  solution  of  the  problem.  Con¬ 
sequently  can  be  identified  as  the  denatura- 
tion  temperature  given  by  the  SCP.  Moreover, 
assuming  that  the  free  energy  can  play  the  role 
of  an  effective  potential  at  temperature  T,  one 
can  see  that,  as  T  approaches  the  melting  tem¬ 
perature,  the  free  energy  exhibits  a  maximum 
followed  by  a  decreasing  part.  In  a  “mechanical" 
equation  of  motion,  such  a  potential  would  give 
stable  open  bubbles. 


4.  Is  DNA  melting  a  one-dimensional  phase 
transition? 

The  model  discussed  above  has  been  able  to 
describe  some  of  the  main  features  of  DNA 
melting  as  it  is  observed  ex(>erimentally.  How¬ 
ever  there  is  a  crucial  point  in  which  this  model 
gives  incorrect  results,  it  is  the  sharpness  of  the 
phase  transition.  For  an  homopolymer,  the  ex¬ 
periments  show  that  the  melting  occurs  very 
abruptly  over  a  temperature  interval  which  is 
only  a  few  K  or  even  less.  This  poses  a  very 
fundamental  question  since  DNA  is  basically  a 
one-dimensional  system,  which  is  not  expected 
to  have  a  phase  transition.  We  would  like  to 
conclude  our  paper  by  showing  that,  within  a 
one-dimensional  model  with  short  range  interac¬ 
tions,  a  sharp  transition  is  possible  if  one  takes 
into  account  properly  the  nonlinearity  of  the 
base  stacking  interaction  [21 J.  The  possibility  of 
a  phase  transition  in  one-dimensional  DNA  was 
already  examined  within  the  Ising-model  ap¬ 
proach  by  Poland  and  Scheraga  [22]  and  Azbel 
[23]  who  concluded  that  it  can  be  attributed  to 
cooperativity  effects  and  to  the  role  of  the  wind¬ 
ing  entropy  released  when  the  two  strands  sepa¬ 
rate.  A  simple  extension  of  our  DNA  model  (1) 
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can  describe  the  dynamics  of  these  effects  and 
gives  a  very  sharp  transition  in  agreement  with 
the  experiments. 

The  stacking  energy  between  two  neighboring 
base  pairs  is  described  by  the  anharmonic 
potential: 

x(y„ (16) 

This  new  intersite  coupling,  replacing  the  simple 
harmonic  coupling  of  our  previous  approach,  is 
responsible  for  qualitatively  different  properties. 
The  choice  of  this  potential  has  been  motivated 
by  the  observation  that  the  stacking  energy  is  not 
a  property  of  individual  bases,  but  a  character  of 
the  base  pairs  themselves  [24].  When  the  hydro¬ 
gen  bonds  connecting  the  bases  break,  the  elec¬ 
tronic  distribution  on  bases  are  modified,  causing 
the  stacking  interaction  with  adjacent  bases  to 
decrease.  In  eq.  (16),  this  effect  is  enforced  by 
the  prefactor  of  the  usual  quadratic  term  ( y„  - 
y„  _ ,  )^  This  prefactor  depend*^  on  the  sum  of  the 
stretchings  of  the  two  interacting  base-pairs  and 
decreases  from  5  AC(  1  -l-  p)  to  j  IC  when  either  one 
(or  both)  base-pair  is  stretched.  Although  its 
form  was  chosen  for  analytical  convenience,  the 
qualitative  features  of  potential  (16)  are  in 
agreement  with  the  properties  of  chemical  bonds 
in  DNA.  They  also  provide  the  cooperativity 
effects  that  were  introduced  phenomenologically 
in  the  Ising  models.  A  base  pair  that  is  in  the 
vicinity  of  an  open  site  has  lower  vibrational 
frequencies,  which  reduces  its  contribution  to  the 
free  energy.  Simultaneously  a  lower  coupling 
along  the  strands  gives  the  bases  more  freedom 
to  move  independently  from  each  other,  causing 
an  entropy  increase  which  drives  a  sharp  transi¬ 
tion.  Our  approach  can  be  compared  to  recent 
views  on  structural  phase  transition  in  elastic 
media  which  stress  that  intrinsic  nonlinear  fea¬ 
tures  characterize  the  physics  of  these  trans¬ 
formations,  and  extend  the  standard  soft  mode 
picture  [25,26].  It  is  important  to  notice  that. 


although  cooperativity  in  introduced  through 
purely  nearest  neighbor  coupling  terms,  it  has  a 
remarkable  effect  on  the  ID  transition. 

Figure  5  shows  the  drastic  change  introduced 
by  the  anharmonic  coupling  on  the  specific  heat 
of  the  model  calculated  by  the  transfer  integral 
method  [21].  The  full  curve  corresponding  to  the 
anharmonic  stacking  interaction  shows  a  sharp 
peak  very  similar  to  that  one  would  expect  from 
a  first-order  phase  transition,  whereas  the  har¬ 
monic  coupling  investigated  before  gives  only  a 
smooth  maximum.  This  result  suggests  that,  al¬ 
though  DNA  structure  is  very  complicated,  a 
simple  nonlinear  model  is  able  to  reproduce  with 
a  good  agreement  the  main  experimental  fea¬ 
tures  of  its  dynamics  for  the  fluctuational  open¬ 
ings  as  well  as  the  melting  curves.  Indeed  such  a 
model  does  not  pretend  to  give  exact  quantita¬ 
tive  results  that  would  fit  exactly  the  experimen¬ 
tal  melting  curve  of  an  heteropolymer  with  all  its 
small  structure.  Our  feeling  is  that  it  is  more 
important  to  get  a  basic  understanding  than  to 


Fig.  .5.  Variation  of  the  specific  heat  versus  temperature.  The 
very  narrow  peak  corresponds  to  the  anharmonic  coupling 
case  (a  =  0.35,  p  =  0.5),  the  dotted  curve  and  the  solid  broad 
peak  to  harmonic  coupling  {k'  =  l.5k  and  k''=k.  respec¬ 
tively). 
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try  to  explain  the  very  small  experimental  result. 
This  does  not  mean  that  we  think  that  the  simple 
model  that  we  discussed  here  is  complete  and 
should  not  be  extended.  But  any  extension  will 
have  to  be  measured  for  the  new  fundamental 
feature  it  brings  against  the  unnecessary  compli¬ 
cations  it  introduces.  Finally  we  would  like  to 
point  out  that,  although  we  have  discussed  non¬ 
linear  dynamics  of  DNA,  we  have  not  intro¬ 
duced  solitons  in  our  picture  (although  the  dis¬ 
crete  breathers  are  probably  close  to  being  sol¬ 
itons).  It  is  simply  because  they  do  not  seem  to 
be  necessary  to  explain  the  dynamical  features  of 
DNA  we  are  interested  in.  Solitons  are  marvell¬ 
ous  objects  but  nonlinear  science  can  also  live 
without  them. 
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In  this  paper  we  describe  laser  light  interacting  with  nematic  liquid  crystals.  The  paper  begins  with  a  summary  of 
recent  experimental  results  of  E.  Braun,  L.  Faucheux,  and  A.  Libchaber  in  which  the  liquid  crystal  sample  is  studied 
in  three  geometries  -  film,  pipe,  and  droplet.  Then,  after  a  very  brief  glimpse  at  the  history  of  liquid  crystals,  a 
theoretical  model  of  the  interacting  system  is  described.  In  a  one  transverse  dimensional  idealization,  we  investigate 
the  pipe  and  film  configurations.  In  these  cases  the  model  reduces  to  a  coupled  system  of  nonlinear  pde's  -  an 
elliptic  sine-Gordon  equation  for  the  director  field  coupled  to  a  Schroedinger  equation  for  the  electromagnetic  field. 
Properties  and  qualitative  behavior  of  this  coupled  system  are  described,  both  numerically  and  theoretically.  As  an 
illustrative  example  of  boundary  layer  analysis  of  such  coupled  light-nematic  systems,  we  describe  calculations  in  the 
film  geometry  in  some  detail.  Results  of  this  analysis  include:  (i)  an  extension  of  the  Frederiks  bifurcation  analysis 
to  electric  fields  with  spatial  variation;  (ii)  the  determination  of  the  transverse  scale  at  which  self-focusing  saturates 
in  this  nematic;  (iii)  the  derivation  of  a  nonlocal  nonlinear  Schroedinger  equation  which  governs  the  inner  structure 
of  the  laser  beam.  We  conclude  the  paper  with  a  summary  of  similar  boundary  layer  calculations  for  light-nematic 
systems  in  other  geometries. 


1.  Introduction 

Liquid  crystals  provide  an  optical  medium 
with  very  strong  nonlinearity.  For  example,  the 
coefficient  of  nonlinearity  in  liquid  crystals  can 
easily  be  10^-10'°  times  greater  than  in  “typ¬ 
ical”  nonlinear  optical  media  such  as  carbon 
bisulfide  (CS2).  The  extreme  nonlinearity  of 
liquid  crystals  permits  the  investigation  of  non¬ 
linear  optical  effects  such  as  self-focusing  and 
filamentation  with  low  power  cw  lasers.  For  this 
reason  liquid  crystals  are  excellent  materials  for 
the  investigation  of  laser  radiation  interacting 
nonlinearly  with  matter,  and  thus  they  provide 


'  This  lecture  is  dedicated  to  A1  Scott,  who  has  shown  us 
many  beautiful  properties  of  the  sine-Gordon  equation. 
^  Funded  in  part  by  AFOSR-9()-0161  and  by  NSF  DMS 
8922717  AOl. 

^  Funded  in  part  by  AFOSR  F-49620-92-J-  0023F,  NSF 
DMS-9 100383  and  NSF  DMS-9 157492. 


an  Optical  arena  for  the  study  of  the  physics  of 
complex  nonlinear  patterns. 

In  recent  experiments  of  E.  Braun,  A.  Libch¬ 
aber,  and  L.  Faucheux  [  1  ],  nematic  liquid  crys¬ 
tals  (MBBA,  6CB,  and  E209)  were  irradiated 
with  light  from  a  cw  argon  laser  with  1  watt  of 
power.  A  linearly  polarized  argon  laser  beam  in 
the  TEMoo  gaussian  mode  (the  pump  beam) 
was  applied  to  the  sample  of  liquid  crystal  which 
sat  on  the  stage  of  an  inverted  microscope.  The 
pump  beam  had  a  waist  of  ~  50  /im  at  the  bot¬ 
tom  plate  of  the  sample.  Thermal  effects  were 
negligible,  while  scattering  losses  were  more  sub¬ 
stantial  but  controlled. 

A  detailed  description  of  the  experimental 
results  may  be  found  in  [2,1  ].  Here  we  restrict 
ourselves  to  a  few  remarks  about  them,  focusing 
attention  upon  two  representative  experiments. 
Studies  were  performed  in  three  geometries  - 
film,  droplet,  and  capillary  tube.  Self-focusing 
effects  were  observed  in  all  three  geometries.  In 
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Fig.  1.  Self-focusing  and  filamentation  in  a  droplet  config¬ 
uration  of  the  nematic.  Note  the  boundars  of  the  droplet  Fig.  2.  Longitudinal  view  of  self-focused  filaments  in  the 

is  observable.  Also  note  that  polarization  elTects  produce  capillars  tube  configuration  of  the  nematic. 

"ordinarv"  and  "extraordinarv"  rays.  The  latter  coherently 

self-locus.  diameter  ~  1.5  mm,  was  coated  with  the 

polymer  M.AP  to  enforce  tangential  alignment  of 
the  thin  film  case,  the  observations  were  made  the  director  at  the  glass  surface,  and  thus  to  en- 

outside  the  liquid  crystal  “after"  the  light  beam  sure  uniform  bounday  conditions  along  the  cap- 

had  passed  through  the  thin  sample;  thus,  self-  illary  tube.  The  liquid  crystal  was  then  injected 

focusing  was  observed  only  indirectly  .  However.  in  the  nematic  phase.  One  end  of  the  capillary 

in  the  droplet  and  tube  cases,  filamentation  was  was  sealed  with  fast  epoxy  to  avoid  any  motion 

directly  observed.  Figure  I  shows  a  beautiful  of  the  nematic.  Cooling  was  achieved  by  a  flow' 

e.xample  of  focusing  and  filamentation  in  the  of  nitrogene  around  the  capillary  .  The  strongest 

droplet  geometry  .  lens  in  this  system  is  a  diverging  one  due  to  the 

In  order  to  achieve  better  control  of  thermal  meniscus  at  the  air-liquid  cry  stal  interface.  With 

effects.  Braun,  et  al.  (2,1)  changed  to  the  geom-  this  setup,  the  argon  laser  beam  was  focused  in- 

eiry  of  a  cylindrical  capillary  tube,  together  with  side  the  sample,  far  from  the  entrance  meniscus, 

a  different  nematic  liquid  crystal.  E209.  which  and  the  scattered  light  was  observed  from  below, 

offers  a  wide  range  in  temperature  in  the  ne-  Figure  2  shows  a  striking  longitudinal  view 

matic  phase.  A  cy  lindrical  capillary  tube,  of  in-  of  the  light  field  along  the  capillary  tube.  Many 
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Other  observations,  for  several  increasing  values 
of  power  in  the  pump  beam,  can  be  found  in 
[  2, 1 J .  There  one  observes  the  presence  of  a  focal 
spot,  and,  as  the  pump  intensity  is  increased,  its 
“movement”  along  the  tube  toward  the  meniscus 
at  the  front  entry.  As  the  intensity  is  further  in¬ 
creased,  this  movement  stops  and  a  second  spot 
forms  and  moves  toward  the  front  of  the  tube. 
Three  coexisting  focal  spots  have  been  observed 
in  the  experiments  [2,1  ].  Also  one  observes  the 
onset  of  transverse  undulations  of  the  filament. 
The  wavelength  of  these  undulations  at  first  de¬ 
creases,  and  then  increases  with  increasing  pump 
intensity  [1].  Finally,  at  still  higher  values  of 
pump  intensity,  two  filaments  form  and  inter¬ 
act.  As  far  as  we  know,  these  are  the  first  obser¬ 
vation  of  the  actual  longitudinal  behavior  of  fil- 
amentation  of  a  low  power  cw  laser  beam  in  a 
nonlinear  medium.  In  any  case,  it  is  certainly  a 
striking  visualization. 


2.  Brief  overview  of  the  history  of  liquid  crystals 

The  twelfth  International  Liquid  Crystal  Con¬ 
ference  was  held  in  Germany  in  1988.  One  as¬ 
pect  of  that  conference  was  the  celebration  of  the 
one  hundredth  anniversary  of  the  “discovery”  of 
liquid  crystals.  At  that  conference,  H.  Kelker  de¬ 
scribed  that  history.  Our  account  is  taken  from 
the  second  chapter  of  a  recent  book  by  Peter 
Collings  [3]. 

During  the  lime  period  from  1850-1888,  just 
before  the  “discovery”  of  liquid  crystals,  several 
European  scientists  were  observing  striking  vi¬ 
sual  effects  such  as  cloudiness  and  color  changes 
which  would  later  be  attributed  to  the  liquid 
crystal  phase  of  their  samples.  For  the  most  part 
these  scientists  were  biologists  who  were  inves¬ 
tigating  under  microscopes  the  effect  of  temper¬ 
ature  on  different  biological  materials  including 
the  outer  covering  of  nerve  fibers,  natural  fats, 
and  cholesterol.  An  Austrian  botanist,  Friedrich 
Reinitzer,  is  generally  credited  with  the  actual 
discovery  of  liquid  crystals  in  his  studies  of 


cholesterol  in  plants.  In  1 888  he  was  observing 
the  melting  behavior  of  an  organic  substance  re¬ 
lated  to  cholesterol  and  noted  that  it  possessed 
two  melting  points  -  at  145.5°C  it  melted  to  a 
cloudy  liquid  and  at  178.S°C  this  cloudy  liquid 
turned  clear.  He  also  observed  the  appearence 
of  blue  colors  at  the  phase  transitions.  The  sub¬ 
stance  that  Reinitzer  was  observing  was  in  fact 
cholesteryl  benzoate,  a  chiral  nematic  liquid 
crystal. 

Reinitzer  sent  some  of  his  samples  to  a  Ger¬ 
man  physicist,  Otto  Lehmann,  who  had  just  de¬ 
veloped  an  important  microscope.  Lehmann’s 
microscope  was  a  significant  technical  advance¬ 
ment  for  that  time  period  in  that  it  had  a  heat¬ 
ing  stage  that  allowed  him  to  observe  crystal¬ 
lization  as  temperature  was  slowly  lowered,  as 
well  as  polarizers  which  allowed  him  to  view  po¬ 
larization  phenomena.  Through  his  observations 
of  Reinitzer’s  samples,  Lehmann  became  con¬ 
vinced  that  the  cloudy  liquid  was  a  liquid  phase, 
but  a  phase  that  affected  polarized  light  in  a  man¬ 
ner  typical  of  solid  crystals.  The  substance  had 
the  flow  properties  of  a  liquid,  but  the  optical 
properties  of  a  solid,  and  led  Lehmann  to  call 
such  substances  “liquid  crystals”. 

As  described  by  Collings  [3],  European 
chemists,  biologists,  and  physicists  continued  to 
study  liquid  crystals,  both  experimentally  and 
theoretically,  during  the  first  half  of  the  twen¬ 
tieth  century.  The  theory  culminated  with  the 
continuum  elastic  theory  of  English  physicist  F. 
C.  Frank  in  the  1950’s.  Little  was  done  in  liquid 
crystal  research  from  the  end  of  World  War  II 
through  the  early  1960’s,  perhaps  because  no 
important  applications  had  emerged.  The  dis¬ 
covery  of  liquid  crystal  displays  (LCD’s)  at  the 
end  of  the  1 960’s  certainly  changed  the  status  of 
research  in  the  area!  Today  liquid  crystal  physics 
is  alive,  well,  and  dynamic  -  with  a  great  many 
scientists  actively  studying  nonlinear  properties 
of  this  liquid  crystal  phase  of  matter. 

While  this  “history  of  liquid  crystals”  is  cer¬ 
tainly  interesting,  we  elected  to  describe  it  in 
such  detail  in  our  paper  for  another  reason.  Our 
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conference  here  at  the  Technical  University  of 
Denmark  has  as  one  of  its  goals  the  celebration  of 
the  sixtieth  birthday  of  Professor  Alwyn  C.  Scott. 
As  we  learned  about  the  history  of  liquid  crys¬ 
tals,  we  were  struck  by  the  similarities  between 
the  development  of  the  science  of  liquid  crystals 
and  the  manner  with  which  A1  does,  and  advo¬ 
cates  doing,  science:  -  The  development  of  liq¬ 
uid  crystals  was  interdisciplinary  -  between  the 
biological  and  physical  sciences;  international., 
nonlinear,  and  driven  by  experimental,  theoreti¬ 
cal,  and  practical  developments.  A  brief  consid¬ 
eration  of  such  historical  perspectives  certainly 
seems  appropriate  at  our  conference  on  Future 
Directions  of  Nonlinear  Dynamics  in  Physical 
and  Biological  Systems. 

We  conclude  this  historical  section  with  two 
additional  references:  the  book  of  de  Gennes 
[4],  and  the  survey  [5]. 


3.  Theoretical  considerations 

We  are  interested  in  a  coupled  Held  descrip¬ 
tion  of  the  interaction  of  laser  light  with  a  ne¬ 
matic  liquid  crystal.  A  nematic  liquid  crystal 
may  be  thought  of  as  consisting  of  cigar  shaped 
“objects”  which  are  either  polarized  or  easily 
polarizable.  These  objects  have  both  transla¬ 
tional  and  orientational  degrees  of  freedom.  In 
the  liquid  crystal  phase,  the  material  behaves  as 
a  fluid  with  respect  to  its  translational  degrees 
of  freedom;  however,  with  respect  to  its  orien¬ 
tational  degrees  of  freedom,  long  range  order 
is  maintained  as  if  the  material  were  in  a  crys¬ 
talline  phase.  In  this  work  we  will  ignore  fluid 
motion  and  concentrate  upon  the  orientational 
behavior  of  the  liquid  crystal. 

This  orientation  is  described  by  a  director field, 

H{x,t),  |r|  =  1. 

This  unit  vector  « (x,  f )  describes  the  density  of 
“rods”,  at  position  x  at  time  t,  oriented  in  the 
direction  n. 
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Fig.  3.  Schematic  of  the 

“twist”,  “splay",  and  “bend"  elastic 

distortions  of  the  nematic. 

Rotation  of  these  rods  is  opposed  by  elastic, 
nearest  neighbor,  forces.  Three  independent  ori¬ 
entational  distortions  exist  for  the  vector  field 
«(x,/).  These  are  called  “twist”,  “splay”,  and 
“bend”;  they  are  displayed  in  the  cartoons  of  Fig. 
3  [6];  the  energy  densities  associated  with  each 
are  given  by 

fT  =  /^T[ll  (Vxil)]2, 

Ss=/Cs(Vn)^, 

SB  =  KBln  X  (V  X  n)]^, 

where  Kt,  Ks,  and  Kb  are  the  elastic  constants  as¬ 
sociated  with  twist,  splay,  and  bend  respectively. 
The  total  elastic  energy  is  then  given  by 

Eci  =  ij  (^T  +  +  ^b)  d^x. 

Infinitesimal  variations  (subject  to  the  unit  di¬ 
rector  constraint)  of  this  elastic  energy  produce 
an  elliptic  operator  which,  in  the  “one  constant” 
approximation  (K  =  Ks  =  Kb  =  Kj)  reduces 
to  the  Laplacian.  (Typical  nematic  liquid  crys¬ 
tals  have  ATs  <  A^b  ~  ^t-  ) 

We  seek  to  describe  the  interaction  of  this  di¬ 
rector  field  with  laser  light.  First,  note  that  the 
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director  n  is  very  sluggish  in  t  and  can  only  re- 
spond  to  the  time  averaged  electromagnetic  field 
E.  A  phenomenological  description  of  this  inter¬ 
action  is  as  follows:  The  liquid  crystal  has  its  di¬ 
rector  field  R  specified  on  the  boundary  of  the 
sample.  When  the  laser  shines  on  this  sample, 
energy  minimization  favors  the  alignment  of  n 
with  E.  During  this  alignment  procedure,  E  in¬ 
duces  a  dipolar  change  in  the  index  of  refrac¬ 
tion.  The  index  of  refraction  is  maximal  when  n 
is  parallel  to  E,  producing  a  “self-focusing"  lens. 
The  reason  behind  the  large  values  of  the  nonlin¬ 
ear  dielectric  constant  in  liquid  crystals  is  that  it 
is  relatively  easy  to  rotate  the  director  field. 

The  coupled  equations  for  the  electric  field  in¬ 
teracting  with  the  director  field  are  given  by 


-V  X  V  X  £  +  /cq  (£■  -I-  a(n  •  E)n  )  =  0, 
K^nxV^n  +  g9t(n  x  (n  ■  E)E*)^  =0, 

(3.1) 


where  g  denotes  a  coupling  parameter.  Here  we 
note  that  E  is  the  time  averaged  complex  electric 
field  amplitude,  and  that  the  interaction  in  the 
coupled  equations  (3. 1 )  is  invariant  to  n  — >  -n. 

It  will  be  convenient  to  specialize  to  a  simple 
two-dimensional  geometry  which  can  apply  in 
either  the  “film”  or  the  “tube”  cases: 


R  = 


E  = 


sin0  \ 

0  , 

(3.2) 

COS0  J 

0  , 

(3.3) 

H 

where  the  angle  0  =  d(x,z),  and  the  compo¬ 
nents  of  the  field  {F,H)  also  only  depend  upon 
(x,z).  In  this  two  dimensional  geometry,  z  is  the 
longitudinal  coordinate;  x  is  the  transverse  coor¬ 
dinate;  F  and  H  are  the  transverse  and  longitu¬ 
dinal  field  components,  respectively.  If,  at  “lead¬ 
ing  order”,  one  neglects  the  longitudinal  field  H, 


the  equation  for  the  director  takes  the  form  of 
an  elliptic,  variable  coejficient,  sine-gordon  equa¬ 
tion: 

0xx  +  0rr  +  |^|^sin20.  (3.4) 

3. 1.  The  Frederiks  transition 

The  relevant  nonlinear  effect  for  our  studies 
is  known  as  the  “Frederiks  transition”  and  can 
be  easily  seen  from  eq.  (3.4):  One  assumes  an  F 
field  which  is  constant  in  both  x  and  z,  and  seeks 
a  director  field  0  which  is  a  function  of  z  only. 
In  this  situation,  one  reduces  (3.4)  to  the  ode 

0ZZ  +  sin  20  =  0, 

0{z  =  0)  =  0{z  =  L)=0. 


Here  we  have  introduced  a  parameter  X  known 
as  the  Frederiks  transition  length, 

X^\F\-\ 

and  have  introduced  the  boundary  conditions  on 
the  director  that  r  =  £  at  z  =  0  and  z  =  L\ 
that  is,  0  vanishes  at  the  front  and  rear  faces  of 
the  sample.  0  =  0  is  one  solution  of  this  two- 
point  boundary  problem  corresponding  to  align¬ 
ment  with  the  boundary  conditions  of  the  direc¬ 
tor  throughout  the  sample.  On  the  other  hand,  as 
the  intensity  of  the  constant  F  field  is  increased, 
solutions  of  the  two-point  boundary  experience 
a  “pitchfork  bifurcation”  and  a  second  solution 
appears  with  0  =  0(z).  This  second  solution 
represents  a  z  dependent  rotation  of  the  director 
field  within  the  sample,  away  from  its  boundary 
orientation  toward  the  orientation  of  the  i:  com¬ 
ponent  of  the  electric  field.  This  bifurcation  is 
the  primary  nonlinear  effect  behind  our  study. 
We  will  investigate  it  numerically  and  analyti¬ 
cally,  in  the  more  general  situation  of  coupled 
fields  with  both  x  and  z  dependence. 
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S.2.  A  model  problem 

By  scaling  all  lengths  on  the  Frederiks  tran¬ 
sition  length  of  the  incoming  pump  beam,  the 
experimental  value  of  the  optical  wavenumber 
becomes  Ac  ~  189.  (Other  natural  scalings  pro¬ 
duce  much  larger  nondimensional  wave  num¬ 
bers.)  With  such  a  lai^e  wavenumber,  the  elec¬ 
tric  field  is  certain  to  obey  a  slowly  varying  enve¬ 
lope  approximation.  Here  we  study  the  simplest 
such  (two  dimensional)  envelope  model  -  one 
which  incorporates  a  coupling  between  paraxial 
optics  and  nematic  deformation: 

2ikF, -{■  Fxx k^asin^d  F  =  0,  (3.5) 

<?«  +  +  |/=’psin20  =0.  (3.6) 

Here  the  refractive  anisotropy  is  represented  by 
the  parameter  a,  which  is  approximately  0.25  for 
the  experiments  (2,1  ].  Depending  upon  bound¬ 
ary  conditions,  this  model  can  be  used  to  study 
either  the  thin  film  or  the  tube  geometries.  In  the 
thin  film  case,  the  boundary  conditions  are 

F{x,z  =  0)  =  Fin(x) 

0(a:,O)  =d(x,L)  =  0, 

0(x,z)  —>0  asx— >±00.  (3.7) 

(In  the  tube  geometry,  the  latter  boundary  con¬ 

dition  is  replaced  hy  6 {±W,z)  =  0,  where  W  is 
the  width  (radius)  of  the  tube.) 

Heuristically,  eq.  (3.5)  is  a  scalar  paraxial 
equation  that  includes  the  effects  of  diffraction 
and  nematic  anisotropy,  while  eq.  (3.6)  models 
static  nematic  distortions.  Of  course,  this  two 
dimensional  model  neglects  birefringence  (po¬ 
larization)  effects  and  scattering  losses  -  which 
are  important  for  quantitative  detail.  However, 
this  simple  model  has  proven  useful  in  the  de¬ 
velopment  and  description  of  our  mathematical 
approximations. 


4.  An  initial  numerical  study 

We  begin  with  a  brief  summary  of  our  initial 
numerical  study  of  model  (3. 5)- (3.6).  To  set 
up  the  numerical  algorithm  we  impose  Dirichlet 
boundary  conditions  for  the  0  field  in  z,  Neu¬ 
mann  boundary  conditions  for  the  0  field  in  x, 
as  well  as  periodic  boundary  conditions  with  re¬ 
spect  to  the  transverse  variable  x  for  the  F  field. 
The  transverse  width  models  an  extremely  wide 
thin  film,  and  in  practice  is  chosen  wide  enough 
that  it  does  not  effect  the  results.  (The  choice 
of  transverse  Neumann  boundary  conditions  has 
been  made  to  improve  this  insensitivity. ) 

The  parabolic  equation  for  F  is  integrated 
with  an  “integrating  factor”  method  which  ex¬ 
actly  factors  out  the  Laplacian  from  the  evolu¬ 
tion  in  z  by  introducing  analytically  the  factor 
exp  I  (iz/2A: )  ].  The  elliptic  equation  for  the 

0  field  is  integrated  with  a  “relaxation  method”. 
Both  the  “integrating  factor”  and  “relaxation” 
methods  are  implemented  through  “fast  Fourier 
transforms”,  which  are  natural  given  the  bound¬ 
ary  conditions. 

Sample  results  from  these  numerical  experi¬ 
ments  are  displayed  in  Fig.  4.  In  these  numer¬ 
ical  experiments,  k  =  100.00,  q  =  0.10,  and 
the  critical  field  for  the  “Frederiks  transition” 
(below  which  there  is  no  transition)  is  jFcrl 
0.12.  In  Fig.  4a,  the  maximum  field  intensity  is 
(Fmaxl  =  0.20,  somewhat  above  the  transition 
value;  in  Fig.  4b,  the  maximum  field  intensity  is 
larger,  IFmaxI  =  0.50,  well  above  the  transition 
value.  Examination  of  these  figures  shows: 

(i)  The  transverse  profile  of  the  electric  field 
|F|  is  much  narrower  than  that  of  the  nematic 
field  0. 

(ii)  The  contour  lines  of  the  nematic  field  0 
show  transition  regions  (in  z )  near  the  front  and 
rear  faces  of  the  nematic  film.  These  rather  sharp 
transitions  are  particularly  apparent  at  the  lai^er 
input  field  of  Fig.  4b. 

(iii) As  the  electric  field  F  “propagates”  in 
z,  it  “self-focuses”  in  that  its  intensity  increases 
while  its  tranverse  structure  narrows.  Again, 
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Fig.  4.  Sample  numerical  experiments  for  the  scalar  model  problem,  for  a  =  0.10,  A;  =  100.00.  At  these  parameter  values 
the  critical  field  for  the  Frederiks  transition  has  [fcrl^  (0.12)^.  (a)  Pump  beam:  |fin«x|^  =  (0.2)^;  (b)  Pump  beam: 
IFinmP  =  (0.5)^.  In  each  case,  we  display  contour  lines  for  the  nematic  angle  0;  the  electric  field  intensity  profiles  |Fp  vsjc, 
at  the  front  (dotted  curve)  and  rear  (solid  curve)  faces;  the  behavior  of  the  electric  field  intensity  (solid)  and  of  the  nematic 
(dotted)  along  the  center  beam  line;  and  a  tranverse  profile  near  the  rear  face  for  the  electric  field  intensity  (solid)  and 
nematic  angle  (dotted). 
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Fig.  S.  A  sketch  of  the  boundary  layer  behavior  for  the  thin 
film  case. 


this  self-focusing  is  particularly  apparent  for  the 
larger  input  field  of  Fig.  4b.  In  both  cases  a  and 
b,  the  self-focusing  continues  to  the  rear  face  of 
the  film.  It  is  not  clear  from  these  experiments 
what,  if  anything  in  the  absence  of  scattering 
losses,  saturates  this  self-focusing  process. 

Clearly,  for  this  simple  model  problem,  these 
numerical  discretization  effects  could  be  elim¬ 
inated  with  a  finer  resolution  of  the  numeri¬ 
cal  grid.  Furthermore,  one  could  also  further 
stress  the  model  by  running  the  numerical  ex¬ 
periments  at  more  realistic  parameter  values  - 
for  example,  k  could  be  increased  from  100.00 
to  189.00  and  a  from  O.l  to  0.25.  However, 
the  “boundary-layer-self-focusing”  nature  of 
the  phenomena  make  this  solely  numerical  ap¬ 
proach  unrealistic  for  the  actual  system.  Thus, 
even  for  the  model  problem,  it  is  judicious  to 
develop  a  mathematical  analysis  based  upon  a 
boundary  layer  reduction. 


5.  A  boundary  layer  analysis  for  the  thin  film 
case 


used  in  the  analysis  is  a  transverse  boundary  layer 
located  at  the  beam  center.  While  the  nematic 
field  6  has  contour  lines  (see  Fig.  4)  reminiscent 
of  longitudinal  boundary  layer  behavior  near  the 
front  and  rear  faces  of  the  nematic  film,  this 
longitudinal  transition  scale  is  much  longer  than 
the  characteristic  width  of  the  self-focused  elec¬ 
tric  field  F.  In  the  absence  of  scattering  losses, 
the  self-focusing  continually  narrows  the  electric 
field  until  diffraction  saturates  this  process  at  a 
very  narrow  beam.  This  transverse  self-focusing 
is  also  apparent  in  the  numerics  of  Fig.  4b.  In¬ 
deed,  it  is  this  transverse  effect  of  the  electric 
field,  rather  than  the  longitudinal  behavior  of  the 
nematic,  that  limits  numerical  resolution,  and 
introduces  the  small  width  scale  for  our  analysis. 

Let  y  =  y(k)  denote  a  large  parameter  which 
fixes  the  inner,  transverse  beam  scale; 

X  =  yx,  y  »  1. 

Here  the  scaling  function  y{k)  will  be  deter¬ 
mined  later  in  the  analysis;  for  the  moment  it  is 
only  taken  to  be  large.  Next  we  assume  that  in  the 
outer-relaxation  and  in  the  inner-beam  regions, 
the  fields  take  the  following  form: 

J0<^>(z,x)  (outer), 

^  1  0<^>(2,O)  -I-  y-'0<‘’>(z,x)  (inner);  ^ 

/  0  (exp.  small)  (outer), 

^  “ \y‘/2Fh»(z,x)  (inner).  ^  ^ 

When  viewed  from  the  outer  scale,  the  \F\^  field 
appears  as  a  delta  function  located  at  the  beam 
center  x  =  0;  in  turn,  from  eq.  (3.6),  the  6  field 
has  a  jump  discontinuity  in  its  first  derivative 
dxix  =  0,  z).  Thus, 

\F\^^I5{x), 


With  our  thin  film  model  (3.5),  (3.6),  (3.7), 
we  illustrate  a  boundary  layer  approach  for  the 
light-nematic  system.  The  geometry  is  depicted 
in  Fig.  5.  We  emphasize  that  the  boundary  layer 


where 

+  00 

1=  J  IFpdx 

— OO 


+  00 

J  IF^^^l^dx, 

—  OO 
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and  where  the  scaling  of  insures  the  last 
equality. 

In  the  two  outer-relaxation  regions  ( >  0  and 
jc  <  0),  the  electric  field  is  exponentially  small 
and  the  nematic  field  satisfies  Laplace's  equation 

=  0, 

together  with  vanishing  boundary  conditions  at 
z  =  0,  z  =  L,  X  =  ±oo  and  the  jump  condition 
across  x  =  0.  Thus,  d  has  the  Fourier  series 
representation 

=  ^c„exp^-/i^|x|^  sin 

where  the  constants  c„  will  be  determined  by  the 
jump  condition.  Notice  that  because  6^^^  solves 
Laplace's  equation,  the  transverse  half-width 
of  the  nematic  distortion  is  determined  by  the 
thickness  L  of  the  thin  film. 

To  derive  the  jump  condition,  one  integrates 
eq.  (3.6)  across  the  beam  zone,  paying  attention 
to  scales  and  to  the  local  behavior  of  the  fields. 
The  result  is 

0+ 

+  /sin20<^>  =0. 

0-  x=0 

Defining 


2=0  Z=L 

l,>l2>l,Hr 

Fig.  6.  A  sketch  of  the  qualitative  behavior  of  the  nematic 
distortion  at  beam  center  as  would  be  obtained  by  solving 
the  bifurcation  equation  (S.3)  for  C(z\l). 

C(z) .  This  equation  (5.3)  should  be  viewed  as 
an  extension  of  the  Frederiks  bifurcation  equa¬ 
tion  to  the  boundary  layer  situation  of  variable 
field  strength.  It  must  be  solved  numerically.  Its 
solution  will  behave  as  a  pitch-fork  bifurcation 
and,  above  a  critical  intensity  /c,  will  have  the 
qualitative  behavior  sketched  in  Fig.  6.  Note,  in 
particular,  this  nematic  distortion  C(z;/)  con¬ 
tains  the  "longitudinal  transitions”  in  z. 

Turning  to  the  behavior  of  the  electric  and  ne¬ 
matic  fields  in  the  inner-beam  region,  we  express 
the  nematic  equation  (3.6)  in  the  inner  coordi¬ 
nates  and  solve  it  asymptotically  for 


OO 

C(z)  =  0<‘‘’(z,x  =  0)  =  c„  sin 

n=  I 

one  sees  that  the  jump  condition  yields  the  ex¬ 
pression 


^sin  2y]c„sin  ^n^z^j 

OO  /  \ 

=  y]c„sin  ^n^zj. 


which  is  a  nonlinear  equation  which  determines 
the  constants  c„  as  a  function  of  the  beam  in¬ 
tensity  I;  thus,  it  determines  the  nematic  distor¬ 
tion  at  beam  center  as  described  by  the  function 


CO  OO 

0<'’)  =  -^sin(2C)  (//  IFl^dydy 
X  y 

X  y 

J  J  IFI^dydy).  (5.4) 

—  OO  — OO 

Inserting  this  expression  into  the  field  equation 
(3.5)  written  in  inner  variables  yields,  after  re¬ 
moving  an  appropriate  fast  phase  A:5(x,  z), 

iC,  +  |^G„+^sm(2C)««'’C  =  0,  (5.5) 

where 

F^**)  =  GexpdkS). 
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Examination  of  eq.  (5.S)  shows  that  the  nat¬ 
ural  balance  of  diffraction  against  nematic  dis¬ 
tortion  defines  the  scaling 

y{k)  =  k^^\  (5.6) 


With  this  choice  of  y  scaling,  and  introducing  the 
propagation  scale  z  =  k^^^z,  we  obtain  the  lead¬ 
ing  order  equation  which  governs  the  structure 
of  the  inner  beam: 


oo  oo 

iGz  +  {Gjc,  -  \a single)  J \G\^dydy 

X  y 

X  y 

+  j  j  |Fl2d>^d>’|G  =  0. 


(5.7) 


—  OO  —OO 


Several  remarks  should  be  made  about  this  non¬ 
local,  nonlinear  Schroedinger  equation  (5.7) 
which  governs  the  local  structure  of  the  inner 
electric  field: 

(i)  This  nonlinear  Schroedinger  equation  is  a 
Hamiltonian  system,  with  Hamiltonian 


(ii)  It  depends  upon  C{z),  the  nematic  dis¬ 
tortion  at  beam  center,  which  is  determined  from 
the  outer  problem  as  matched  to  the  inner  beam 
via  bifurcation  equation  (5.3). 

(iii)  Even  though  this  NLS  equation  has  coef¬ 
ficients  which  depend  upon  z,  the  intensity  /  is 
independent  of  z  and  can  be  viewed  as  a  param¬ 
eter  which  is  controlled  by  the  input  laser  beam. 
However,  perhaps  the  most  important  conse¬ 
quence  of  this  boundary  layer  analysis  follows 
from  the  combination  of  the  inner  beam  equa¬ 
tion  (5.7)  with  the  scaling  law  (5.6).  Together, 
these  imply  that,  in  the  absence  of  scattering 
losses,  the  self-focusing  process  terminates  on 
the  scale  x  0(y~^  =  k~^l^). 


6.  Conclusion 

In  this  paper  we  have  described  some  initial 
work  on  the  interaction  of  a  laser  beam  with  a 
nematic  liquid  crystal.  Our  interest  in  this  sys¬ 
tem  stems  from  its  very  large  coefficient  of  non¬ 
linearity  which  allows  the  generation  of  complex 
nonlinear  optical  patterns  with  low  power,  CW 
lasers.  Experimentally,  Braun,  Faucheux,  and 
Libchaber  have  studied  this  system  in  three  geo¬ 
metric  configurations  of  film,  droplet,  and  tube 
[2],  [1  ].  Of  particular  importance  in  these  ex¬ 
perimental  studies  are  the  striking  longitudinal 
views  of  interacting  filaments  in  the  cylindrical 
geometry. 

In  this  paper  we  have  described  a  theoretical 
boundary  layer  analysis  of  an  idealized  (two  di¬ 
mensional,  scalar)  model  of  this  coupled  light- 
nematic  system  in  a  thin  film  geometry.  The 
transverse  boundary  layer  analysis  shows  that 
the  nematic  distortion  at  beam  center  is  de¬ 
scribed  by  a  “local  Frederiks  bifurcation  curve” 
which  captures  transitions  in  the  nematic  near 
the  front  and  rear  faces  of  the  film.  The  analysis 
also  produces  a  scaling  function  which  results 
from  “maximal  balance”  between  diffraction 
and  nonlinearity  and  fixes  (in  the  absence  of 
scattering  losses)  the  spatial  extent  of  the  self- 
focussed  beam.  It  also  yields  a  nonlocal  non¬ 
linear  Schroedinger  equation  which  governs  the 
inner  structure  of  the  beam. 

In  other  theoretical  boundary  layer  analyses 
of  this  coupled  system,  we  have  studied  the  two 
dimensional  “tube  geometry”  for  both  a  model 
scalar[l]  and  full  vector  [7]  optical  field.  In 
[  1  ]  we  show  that  in  the  tube  geometry,  the  two 
dimensional  scalar  model  captures  the  essential 
physics  of  both  the  filamentation  and  the  trans¬ 
verse  undulation  of  the  laser  beam.  The  latter  is  a 
new  phenomenon.  In  order  to  describe  polariza¬ 
tion  effects,  which  are  certainly  absent  in  these 
scalar  models,  we  have  studied  a  full  (but  two 
dimensional)  vector  system  in  the  tube  geome¬ 
try  [7].  In  this  study  we  identify  properties  of 
the  undulation  process  that  are  distinctly  polar- 
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ization  effects. 

Taken  together,  these  experimental,  numer¬ 
ical,  and  theoretical  studies  establish  that  the 
light-nematic  system  is  an  excellent  source  of 
complex,  nonlinear  spatial  patterns.  Further¬ 
more,  our  studies  show  that  coupled  nonlinear 
pde  descriptions  of  light-nematic  interactions 
are  indeed  accessible  theoretically,  through 
boundary  layer  asymptotics  implemented  nu¬ 
merically. 
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Travelling  wave  solutions  are  found  to  be  the  natural  nonlinear  modes  of  wide  aperture  two-level  and  Raman  lasers 
for  frequency  detunings  to  the  positive  side  of  the  gain  peak. 


1.  Introduction 

Complex  pattern  formation  is  commonly  observed  in  spatially  extended,  continuous,  dissipative 
systems  which  are  driven  far  from  equilibrium  by  an  external  stress.  Under  the  influence  of  this 
stress,  the  system  can  undergo  a  series  of  symmetry  breaking  bifurcations  or  phase  transitions  and  the 
resulting  patterns  become  more  and  more  complicated,  both  temporally  and  spatially,  as  the  stress  is 
increased.  Examples  abound  in  ordinary  and  binary  fluids,  in  liquid  crystals  and  chemically  reacting 
media  [  1  ].  Optical  systems,  both  passive  and  active,  are  no  exception  and  considerable  effort  has  been 
expended  recently  to  predict  [2,  3]  and  observe  [4,  5]  pattern  forming  instabilities  in  both  passive 
and  active  nonlinear  optical  systems. 

The  plan  of  the  article  is  as  follows.  In  the  next  section  we  introduce  the  laser  models,  compare 
and  contrast  their  physical  characteristics  and  briefly  review  their  bifurcation  behavior.  An  important 
conclusion  will  be  that  both  laser  models  admit  an  exact  finite  amplitude  travelling  wave  solution 
for  positive  detuning  (Q )  of  the  laser  and  a  homogeneous  solution  for  negative  detuning.  A  weakly 
nonlinear  analysis  near  threshold  for  lasing  in  the  following  section  yields  a  coupled  set  of  amplitude 
equations  of  the  complex  Newell-Whitehead-Segel  type  when  >  0.  Linear  stability  analysis  of 
this  latter  set  shows  that  the  travelling  wave  solution  arises  from  an  instability  of  a  standing  wave. 
The  very  simple  form  of  this  exact  solution  allows  us  to  extend  the  analysis  beyond  threshold  and 
derive  an  equation  for  the  evolution  of  the  phase.  This  yields  analytical  expressions  which  predict  the 
coexistence  of  stable,  Eckhaus  and  zig-zag  unstable  regions  beyond  lasing  threshold.  We  next  provide 
some  preliminary  numerical  results  which  confirm  that  the  wide  aperture  Raman  laser  exhibits  weakly 
turbulent  behavior,  whereby  instabilities  on  an  underlying  travelling  wave  can  spontaneously  nucleate 
seas  of  optical  vortices  and  oU-like  structures.  The  physical  manifestation  of  the  travelling  wave 
solution  is  a  strong  off-axis  emission  of  the  laser.  Finally  we  conclude  with  some  comments  on  ongoing 
and  future  work. 


2.  Two-level  and  Raman  lasers:  background  theory 

The  essential  difference  between  the  two  lasers  lies  in  the  method  of  pumping  employed  in  order  to 
achieve  population  inversion.  Figure  1  shows  a  schematic  of  the  energy  level  schemes  and  pumping 
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(a)  TWO-LEVEL  LASER  (b)  RAMAN  LASER 

LEVEL  SCHEME  LEVEL  SCHEME 


(a)  TWO-LEVEL  LASER  (b)  RAMAN  LASER 


Fig.  1.  Energy  level  diagrams  depicting  pumping  schemes  Fig.  2.  Stability  and  instability  regions  for  the  nonlasing 
for  a  (a)  two-level  and  (b)  Raman  laser.  solution  in  (J$,  re)  parameter  space,  (a)  Two-level  and  (b) 

Raman  laser. 


mechanisms  for  both  lasers.  The  pump  is  the  principal  stress  parameter  for  the  problem.  Inversion  for 
lasing  in  a  two-level  laser  is  created  via  incoherent  pumping  (electrical/flashlamp  discharge,  collisions, 
etc.)  whereas,  in  the  Raman  laser,  a  classic  three-wave  interaction  involving  two  optical  and  one 
material  wave,  introduces  a  strong  coherence  between  the  pump  wave  (amplitude  A )  and  the  laser 
emission  field  (amplitude  e ) .  Figure  1  a  shows  incoherent  pumping  of  a  broad  upper  manifold  of  levels 
with  subsequent  decay  to  form  an  excess  of  population  in  the  upper  lasing  level  [2).  In  the  Raman 
laser  depicted  in  fig.  lb,  the  external  pump  laser  (A)  can  be  detuned  either  above  or  below  (<5  <  0 
or  >  0)  an  intermediate  dipole  coupled  level  (2)  and  the  laser  emission  field  (e)  is  generated  via  the 
three-wave  interaction.  Mirror  optical  feedback  in  both  cases  ensures  that  the  finite  lasing  emission 
field  (c)  will  build  up  from  noise  if  the  external  stress  parameter  r  exceeds  some  critical  value  Kq. 
The  distinction  between  incoherent  and  coherent  pumping  ensures  that  even  the  simple  single  mode 
Raman  laser  exhibits  much  richer  nonlinear  dynamical  behavior  than  its  two-level  counterpart  [6]. 

The  mathematical  description  of  both  lasers  derives  directly  from  the  Maxwell  equations  for  the 
optical  fields  and  the  appropriate  material  density  matrix  equations.  When  appropriately  scaled,  the 
resulting  Maxwell-Bloch  laser  equations  can  be  cast  in  the  form  of  a  generalized  set  of  complex  Lorenz 
equations  for  each  laser. 

Case  A.  Two-level  laser: 


ex-  iaV^e  = -oe op,  px -i- {I \Q)p  =  (r  -  n)e,  nx -i- bn  =  {(e*p  +  ep*),  (1) 

with  Q  =  -ds- 
Case  B.  Raman  laser: 

ex  -  iaV^e  =  -oe  op  \d\en,  a  +  (I  ■^  iQ)p  =  {r  -  n)e  -j-  iS2\e\^p, 

nx-\-  bn  =  ^(e*p  -f-  ep*),  (2) 


with  £2  =  -As  -h  rdy. 


J.  V.  Moloney  et  ai.  /  Space-time  complexity 


129 


The  complex  laser  emission  field  is  denoted  by  e,  the  complex  polarization  field  by  p  and  the 
real  population  inversion  by  n.  The  parameter  <j  is  proportional  to  the  cavity  loss,  b  is  the  ratio  of 
the  polarization  dephasing  to  inversion  decay  rates,  J*  measures  the  laser  cavity-atomic  transition 
frequency  detuning  and  r,  the  principal  stress  parameter  of  the  system,  depends  on  the  nature  of 
the  pumping.  Notice  that  if  the  parameters  Si,i  =  1,2, 3  appearing  in  eqs.  (2)  are  set  to  zero,  both 
problems  are  mathematically  identical.  The  transverse  Laplacian  operator  =  (d^jdx^  +  d^fdy^) 
includes  the  transverse  degree  of  freedom  allowing  for  pattern  formation.  The  coefficient  a,  which 
is  the  inverse  of  the  Fresnel  number,  measures  the  characteristic  width  of  the  transverse  aperture  of 
the  laser.  The  most  significant  new  physics  in  the  Raman  laser  is  the  A.C.  Stark  shift,  which  appears 
in  the  set  (2)  as  a  nonlinear  detuning  term  proportional  to  S2.  As  mentioned  earlier,  the  additional 
terms  proportional  to  <J,,  /  =  1,2,3  appearing  in  the  Raman  laser  model,  substantially  modify  the 
bifurcation  behavior  of  even  the  simplest  single  mode  laser  [6],  making  it  one  of  the  earliest  laser 
systems  showing  evidence  for  chaotic  dynamics  [7]. 

Some  of  the  more  important  simple  solutions  to  eqs.  (1 )  and  (2)  and  their  associated  bifurcation 
behavior  will  now  be  briefly  reviewed.  Both  models  admit  a  trivial  (nonlasing)  solution  ie,p,  n )  =  0, 
whose  linearization  differs  only  in  the  dependence  of  the  detuning  parameter  Q  on  the  stress  parameter 
r  for  the  Raman  laser  (case  B).  The  stability  characteristics  of  this  solution  are  succinctly  captured  in 
the  (Js.  ''c)  plane,  as  shown  in  fig.  2  for  both  cases.  The  laser  oscillation  frequency  and  stress  parameter 
at  onset  of  lasing  are  given  by  the  formulas 

i/c  =  (CTi2c  +  a/:^)/((T  +  1),  Tc  =  1  +  (f2c  -  +  1  (3) 

with  Qc  =  -ds  (case  A)  and  Qc  =  -ds  +  r^S-i  (case  B).  An  important  point  to  note  is  that  for 
<  0,  the  homogeneous  state  (fc  =  0)  has  the  lowest  threshold  (rc  =  1  ■¥  Q}l {a  1)^),  whereas 
for  >  0  the  mode  with  Uq  —  ±v/f2c/a  has  the  lowest  threshold.  The  formula  for  the  laser  frequency 
i^c  which  is  a  simple  generalization  of  the  classic  frequency  pulling  formula,  has  the  obvious  physical 
interpretation  that,  for  >  0  the  laser  will  seek  to  develop  some  transverse  pattern  that  minimizes 
the  detuning,  thereby  maximizing  the  emission.  In  a  large  aspect  ratio  system  one  might  anticipate 
the  spontaneous  formation  of  local  patches  of  patterns  with  random  relative  orientations. 

When  r  >  r^,  the  two-level  laser  undergoes  a  supercritical  (Hopf)  bifurcation  to  a  spatially  homo¬ 
geneous  lasing  state  for  <  0  or  to  a  stable  travelling  wave  state  {e,p,n)  =  (ee‘®,pe‘®, «),  6  = 
k  •  X  +  0)1  for  Q  >  0  with 

e  =  y/b(r  -  rc)  ,  p  =  [  \  -  i(ucla)y/b{r  ~  rc)]e"'^‘ ,  n  =  r  - 

and  o)  =  -((T+l)“'((Tf2-|-a/c^).Inthe  Raman  laser  a  similar  transition  to  a  homogeneous  ( <  0 ) 
or  travelling  wave  (f2  >  0)  state  is  observed  although  in  this  case  the  bifurcation  may  be  sub-  or 
supercritical.  The  lasing  emission  intensity  (can  be  assumed  real)  is  now  given  as  the  solution  to 
the  following  quadratic  equation  in 

2{Q-  ak^)  -  (^2)  -I-  -  ak^)^  -  {r  -  \)(\  +  a)^ 

=  0 


with 
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15  >  t  -J  -« _ 1  1  -1—  I  I  I  A 

0  2  4  «  I  to  12  U  1«  It  A 

wave  vector  k 


Fig.  3.  Existence  domain  for  travelliog  wave  solutions  and 
their  associated  stability  properties  for  a  Raman  laser  «dth 
Q  >0. 


a)=(<r+l)  ^[aS2e^ -aQ -ak^  S\e^lb\,  n^e^jb,  p  =  e(l+ia). 


where 


a  =  (a  +  \  )-^[e^{d2-  d^/b)  -  {Q  -  ak^)l 

It  is  a  straightforward  matter  to  show  that  the  bifurcation  to  this  state  will  be  subcritical  if  the  condition 
2  (Q -ak^ )  (<5i  /^  -^ )  +  (<r  +  1  ^  )/^  <  0  is  satisfled.  Moreover,  it  can  be  shown  using  weakly  nonlinear 
analysis  near  onset  that  this  exact  travelling  wave  solution  appears  as  an  instability  of  a  standing 
wave.  This  point  is  discussed  briefly  in  the  following  section.  Numerical  integration  of  both  sets  of 
laser  equations  ( 1 )  and  (2),  for  one  transverse  spatial  dimension  and  a  variety  of  initial  conditions 
suggests  that  the  above  travelling  wave  with  /cq  =  y/Sifa  is  a  globally  attracting  state  of  the  system 
for  >  0  [8]. 

For  illustrative  purposes,  we  show  in  fig.  3  the  neutral  stability  curve  in  the  {r,k)  plane,  indicating 
the  region  of  existence  of  the  travelling  wave  solution  and  its  stability  characteristics  for  the  case  of 
the  Raman  laser  (case  B).  Observe  the  coexistence  of  Eckhaus,  zig-zag  unstable  and  stable  travelling 
wave  domains.  Our  parameters  were  chosen  to  be  identical  to  those  used  in  ref.  [6  ]  with  the  additional 

diffraction  parameter  a  =  0.05.  Along  the  curve  ( - )  the  bifurcation  is  subcritical  and  the  curve 

(...)  denotes  the  maximum  emission  amplitude  of  the  laser  satisfying  Q  =  ak^.  Note  also  that  the 
lasing  emission  threshold  for  the  travelling  wave  is  substantially  lower  than  for  the  homogeneous  state. 
The  Eckhaus,  zig-zag  and  stable  boundaries  can  be  determined  analytically  from  the  phase  equation 
presented  in  the  following  section.  We  remark  that  the  relative  disposition  of  these  boundaries  depends 
strongly  on  the  sign  of  6.  The  stable  travelling  wave  region  is  the  optics  analog  of  the  Busse  balloon 
relevant  to  pattern  formation  in  large  aspect  ratio  fluids.  A  further  comment  is  in  order  with  regard 
to  the  distinction  between  ID  and  2D  instabilities  in  these  laser  systems.  In  ID,  the  zig-zag  instability 
is  absent  and  only  the  Eckhaus  remains.  We  therefore  expect  to  observe  a  fundamental  difference 
between  1 D  and  2D  pattern  forming  instabilities.  We  will  explore  further  the  natures  of  the  2D  pattern 
forming  instabilities  when  we  discuss  some  numerical  solutions  to  eqs.  (2)  later. 
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Fig.  4.  Neutral  stability  curves  for  (a)  12  >  0  and  (b) 
12  <  0,  applicable  to  both  types  of  laser. 


3.  Near  and  beyond  threshold  behavior  amplitude  and  phase  equations 


Neutral  stability  curves,  corresponding  to  negative  and  positive  detuning  respectively,  are  sketched 
in  figs.  4a  and  4b  and  apply  to  both  laser  models. 

We  now  specialize  to  the  Raman  laser  (case  B)  and  assume  12  >  0  as  this  case  offers  the  richest 
phenomenology.  Carrying  out  the  weakly  nonlinear  analysis  near  threshold  along  the  lines  described 
in  refs.  [8,  9]  for  the  two-level  laser,  we  arrive  at  the  following  coupled  complex  Newell- Whitehead- 
Segel  equations  describing  pattern  formation  near  the  onset  of  lasing. 


=  cr(r-  1)J5,  (1  +  iJj)  -  -i  (<52  +  +  2|52p 


ia+  1) 


dB2 


dt 

=  a(r  -  1  )B2 


-  [QdB2  .  _2d 
-2aY-— -iaV2fi2  + 


dx 


{o  +  iy 


(1  +  ijj)  -  (i  -  i  (<52  +  (1^21^  +  m\^) 


(4) 


with  Q  =  -As  +  ^3  >  0.  Setting  Si  =  0,i  =  1,2,3,  we  recover  the  coupled  complex  Newell- 
Whitehead-Segel  equations  for  the  two-level  laser  near  onset  given  as  eq.  (6.246)  in  ref.  [9],  These 
coupled  amplitude  equations  are  generally  valid  near  threshold  for  the  two-level  laser  with  Q  >  0 
but  their  validity  is  restricted  to  the  case  where  the  bifurcation  in  the  Raman  laser  near  threshold  is 
supercritical  and  12  >  0.  It  can  be  shown  for  both  cases  that  the  standing  wave  solution  BiB\  =  jb  is 
unstable  to  a  travelling  wave  solution  [8].  These  latter  waves  are  seen  to  move  with  the  group  velocity 

Ug  =  ±(2>/a^/(T  +  1). 

By  using  the  exact  travelling  wave  solution  above  threshold  (e,p,n)  =  (ce‘®,pe'®,n),  and  assuming 
that  the  amplitude  is  slaved  to  the  phase,  we  can  derive  an  evolution  equation  for  the  phase  0  itself, 

t(H  1 -j-Ui  +^r)-^  +  ae{xi  -Fxp  +  ^s{k)-^k^  \ 

—  ^^■^^y(jir|  AT]  )€dyy. 
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where 


xi  +  X*  =  i2e/a)[a  -h  {bS2-Sx/a)a^  -  (b52  +  ^i/ct)  -  ISia/a)], 


x(k)  =  +  2a(W2-<5i)l, 


This  rather  complicated  expression  enables  us  to  determine  analytically  the  boundaries  for  phase 
instabilities  and  compare  those  with  results  obtained  from  the  exact  problem  and  amplitude  equation 
approximation. 


4.  Numerical  simulations  beyond  threshold 

We  have  numerically  integrated  eqs.  (2)  for  the  Raman  laser  using  parameter  values  which  yielded 
fig.  3,  for  a  variety  of  initial  conditions  both  without  and  with  a  weak  external  probe  beam.  The  idea 
with  the  external  probe  is  that  it  may  be  possible  to  injection  lock  local  spatial  domains  of  the  wide 
aperture  laser  in  a  controllable  manner.  Choosing  the  pump  parameter  A  =  3.0  (see  fig.  3)  and  initial 
data  corresponding  to  a  plane  travelling  wave  with  wavenumber  lying  in  the  stable  band,  confirmed 
that  the  travelling  wave  was  indeed  stable.  If  a  weak  external  probe  is  applied  so  as  to  favor  the  zig-zag 
instability,  which  occurs  at  right  angles  to  the  travelling  wave  direction,  we  observe  the  appearance  of 
a  rather  robust  solitary  wave  train  as  an  alternating  dark/bright  pattern. 

Figures  5  A  and  5B  show  a  sequence  of  frames  from  a  movie  of  the  near-  and  nsr-field  output  cross- 
section  of  the  laser,  when  the  pump  A  =  3.0  and  the  field  is  initiated  from  noise.  The  sequence 
of  frames  (a)-(f)  in  fig.  5A  shows  the  build-up  from  noise  (random  pattern)  to  a  finite  amplitude 
emission  consisting  of  a  sea  of  optical  vortices  and  bright  ridges  which  arise  from  a  combination  of 
Eckhaus  and  zig-zag  instabilities.  The  bright  ridges  are  aligned  at  right  angles  to  the  direction  of  the 
travelling  wave  which  is  moving  from  the  top  left  to  bottom  right  of  each  frame.  This  complicated 
spatio-temporal  pattern  persists  indefinitely  in  time  with  no  sign  of  any  regular  recurrence.  The  far- 
field  emission  shows  the  initial  appearance  of  a  weak  far-field  ring  with  radius  equal  to  the  value 
k  =  kc  at  threshold.  As  the  amplitude  grows  the  ring  expands  slightly  consistent  with  the  fact  that  the 
wavenumber  k  increases  as  one  moves  above  threshold.  The  ring  in  the  far-field  signifies  the  absence 
of  any  preferred  direction  due  to  the  degeneracy  in  the  local  plane  wave  directions.  Once  the  amplitude 
becomes  significant  the  travelling  wave  starts  to  choose  a  fixed  direction  and  the  emission  adjusts  to  a 
reasonably  well-defined  spot  off-axis.  This  off-axis  spot  remains  intense  with  a  randomly  fluctuating 
lower  amplitude  background  indicative  of  a  weakly  turbulent  state  of  the  laser  emission. 
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5.  Conclosioiis 

Both  two-level  and  Raman  wide  aperture  lasers  are  capable  of  displaying  a  rich  variety  of  pattern 
forming  instabilities.  The  nonlinear  detuning  term  in  the  Raman  laser  is  responsible  for  promoting 
a  broader  range  of  instability  behavior  than  can  occur  in  the  two-level  laser.  The  nature  of  the  in¬ 
stability  depends  on  the  transverse  dimension  of  the  laser  system  and  it  is  anticipated  that  much  of 
the  spatio-temporal  behavior  of  these  lasers  will  carry  over  to  technologically  important  wide  aper¬ 
ture  semiconductor  laser  systems.  The  latter  systems  require  a  much  more  complicated  material  de¬ 
scription,  involving  many-body  interactions  between  carriers  and  holes  at  the  microscopic  level  and 
a  major  challenge  that  remains  is  to  derive  order  parameter  equations  near  onset  that  are  capable  of 
predicting  and  suggesting  means  of  stabilization  of  novel  pattern  shapes  in  a  broad  area  and  vertical 
cavity  surface  emitting  semiconductor  lasers. 
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We  present  two  generalizations  to  standard  optical  coupler  configurations.  Both  cases  show  improved  switching 
characteristics. 


The  switching  characteristics  of  nonlinear  di¬ 
rectional  couplers  have  been  the  object  of  at¬ 
tention  since  the  work  of  Jensen  ( 1  ]  and  Maier 
[2].  These  authors  demonstrated  the  possibil¬ 
ity  of  an  all-optical  switching  in  a  two  wave¬ 
guide  device  based  on  the  occurrence  of  power- 
triggered  switching  (power  self-trapping).  The 
present  study  will  focus  on  means  of  improving 
the  switching  profile  and  decreasing  power  lev¬ 
els  required  for  the  device  to  operate  by  consid¬ 
ering  direct  extensions  of  Jensen’s  original  idea. 
Consider  a  one-dimensional  array  of  N  (differ¬ 
ent)  nonlinear  directional  couplers.  The  nonlin¬ 
ear  waveguide  medium  is  assumed  to  be  of  the 
Kerr  type,  i.e.,  with  refractive  index  given  by 
m  =  noi  -I-  n2i\Ei\^  where  /loi  is  the  linear  in¬ 
dex  of  refraction  of  the  fth  guide,  «2,  is  its  Kerr 
coefficient  and  £,  is  the  electric  field  inside  the 
guide.  Treating  the  system  in  the  coupled-mode 
approximation  and  restricting  each  waveguide 
to  separately  support  a  single  mode  only  and  ne¬ 
glecting  cross-phase  modulation,  the  equations 
for  the  mode  field  amplitudes  a„  are 

i-^=c«an  -I-  F„„_,exp[i(<^„_, -0„)z]a„_, 

+  f^n,n+iexp[i((^„+i  -4)„)z]an+\ 

-Qn\an\^a„,  (1) 

where  n  =  1, 2, . . . ,  iV,  e„  is  a  self-energy  term, 
V„,„-\  and  are  nearest-neighbor  coupling 
coefficients.  The  nonlinear  coefficient  Q„  is  the 


self-phase  modulation  coefficient,  the  “phase” 
<l)„  is  the  wavenumber  of  the  field  along  the  direc¬ 
tion  of  propagation  in  the  nth  waveguide  and  P  is 
the  total  power  injected  into  the  waveguide  sys¬ 
tem  at  z  =  0.  Power  conservation  (52n  l^nl^  = 
P)  requires  The  transforma¬ 

tion  c„  =  a„  e\p{i(p„ z)/VP  reduces  system  ( 1 ) 
to 

~Xn\(^n\^Cn,  (2) 

where  d„  =  e„  -  <j>„,  Xn  =  PQn  and  Y,n  = 
1.  Note  that  in  eq.  (2)  the  “phases”  <t>n  play  the 
role  of  local  “self-energy”  terms  or  local  “site  en¬ 
ergies”. 

System  (2)  is  a  fairly  general  form  of  the  dis¬ 
crete  nonlinear  Schrodinger  equation  (DNSE) 
or  discrete  selftrapping  equation  introduced  by 
Eilbeck,  Lomdahl  and  Scott  in  the  context  of 
coupled  nonlinear  oscillators  [3].  In  most  cases 
of  physical  interest,  systems  of  identical  wave¬ 
guides  are  considered,  in  which  case  <5„  =  <5, 
yn,n-\  =  Kn+i  =  V  and  Xn  =  X-  In  Jensen’s 
two  identical  coupler  device  (A'^  =  2),  in  partic¬ 
ular,  and  for  the  initial  condition  which  places 
all  the  initial  power  in  one  of  the  guides  (say 
guide  1),  the  exact  solution  shows  the  pres¬ 
ence  of  a  “dynamical”  self-trapping  transition 
[  1 ,2,4  ] .  For  values  x  /4  F  <  1  there  is  a  complete 
transfer  of  power  between  the  two  guides  in  a 
trigonometric-like  fashion,  while  for  x/4F  >  1, 
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Fig.  I.  Nonlinear  two-waveguide  with  phase  mis¬ 
match:  space-averaged  transmittance  difTerence  Ti  -  T2 
=  (kil^)  -  (k2l^)  versus  power  (x)  for  several  phase  mis¬ 
match  values  (d)  (F  =  I). 


the  transfer  is  incomplete  and  the  power  re¬ 
mains  trapped  in  the  original  waveguide.  Let  us 
now  consider  a  simple  extension  of  the  above 
system:  two  guides  with  different  linear  indices 
of  refraction  but  rather  similar  Kerr  coefficients. 
Since  the  waveguides  are  dissimilar  in  this  case 
the  propagation  properties  of  the  fields  in  the 
two  guides  are  different,  resulting  in  a  non-zero 
phase  mismatch  <{>2  -  <t>i  ^  0  in  the  system  of 
equations  (1).  The  resulting  set  of  equations 
is  deceivingly  complicated  since  these  equa¬ 
tions  can  be  written  equivalently  in  the  form 


presented  in  eq.  (2),  leading  to 

=  ^C2 -x\c\\^cu 

(3) 

=  dC2  -1-  VCx  -X\C2\^C2. 

(4) 

Assuming  for  simplicity  ei  =62  =  0  we  ob¬ 
tain  J  =  <52  -  ^1  =  -  <A2  as  the  effective 

mismatch  between  the  two  guides.  Equations 
(3),  (4)  represent  the  exact  equations  of  mo¬ 
tion  for  a  nonlinear  nondegenerate  dimer  [5,6]. 
Figure  1  shows  numerical  simulation  results  for 
the  space-averaged  difference  in  transmittance 
(kiP)  -  {k2p)  as  a  function  of  x  (proportional 
to  the  input  power  P)  for  several  values  of  A 
for  a  (0)  =  1  and  C2(0)  =  0.  For  d  =  0  we 
have  the  degenerate  dimer  self-trapping  transi- 


Fig.  2.  Dynamical  phases  for  the  DNT  system  as  a  func¬ 
tion  of  the  linear  coupling  W  and  power  x-  The  regimes 
labeled  “free”,  “chaotic”  and  “self-trapped”,  refer  to  com¬ 
plete,  chaotic  and  incomplete  power  transfer,  respectively. 
The  top  half  of  the  figure  displays  the  case  for  V  and  W 
negative  while  the  lower  half  is  for  V  and  W  positive  [71. 


tion  at  ;f/4F  =  1.  For  A  <  0,  the  position  of 
the  transition  shifts  to  lower  power  levels,  while 
for  J  >  0,  the  tendency  is  reversed  and  more 
power  is  required.  A  phase  diagram  illustrating 
the  different  regimes  of  behavior  can  be  found 
in  ref  [6]. 

Another  simple  variant  of  the  Jensen  model 
is  the  doubly  nonlinear  trimer  (DNT)  system: 
two  identical  nonlinear  waveguides  interacting 
with  a  third  linear  guide  in  a  closed  configura¬ 
tion  [7]  (61  =  62  =  63,  xi  =  X2,X3  =  0).  In 
practice  this  could  be  achieved  by  placing  three 
waveguides  in  a  triangular  array  with  (nonlin¬ 
ear)  guides  1  and  2  in  close  proximity  and  (lin¬ 
ear)  guide  3  symmetrically  below  the  mid  point 
of  the  line  joining  the  other  two.  The  equations 
for  the  field  amplitudes  are 

=  VC2+WC3-x\Ci\^Cu  (5) 
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=  KC,  +  ^^C3-X|C2l^C2,  (6) 

dz 

=  JF(C,  +C2),  (7) 

dz 

where  IV  is  the  coupling  to  the  linear  guide.  Nu¬ 
merical  simulations  show  that  the  self-trapping 
transition  is  sustained  in  the  presence  of  the  third 
linear  guide.  Furthermore,  by  a  judicious  choice 
of  the  value  of  W,  the  transition  can  be  tuned  to 
occur  at  much  lower  input  power  levels  (fig.  2). 

The  study  of  simple  extensions  and  modifi¬ 
cations  to  Jensen’s  original  device  demonstrates 
that  it  is  indeed  possible  to  improve  on  the  char¬ 
acteristics  of  that  device.  In  particular,  lower 
power  and  reasonably  abrupt  switching  are  pos¬ 
sible  in  the  context  of  the  models  we  described 
above.  The  attainment  of  these  desirable  features 
is  a  positive  step  in  the  direction  of  the  fabrica¬ 


tion  of  ultrafast  all-optical  switching  devices  at 
reasonable  power  levels. 
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The  quantum  inverse  scattering  method  is  used  to  solve  the  spectral  problem  of  the  discrete  self-trapping  dimer, 
in  the  case  of  both  a  quadratic  and  a  linear  r-matrix  algebra  representation.  The  first  case  is  solved  by  the  algebraic 
Bethe  ansatz,  while  in  the  case  of  the  linear  r-matrix  algebra  we  use  the  method  of  separation  of  variables.  In  this  last 
case  it  is  shown  that  the  wave  functions  of  the  quantum  discrete  self-trapping  dimer  are  related  to  the  solutions  of  a 
Heun’s  type  equation  and  that  the  system  is  equivalent  to  the  two-site  hyperbolic  Gaudin  magnet  separable  in  elliptic 
coordinates. 

1.  Introduction 

During  the  past  years  the  quantum  inverse  scattering  method  (QISM)  has  been  shown  to  be  a  very 
powerful  tool  in  the  field  of  quantum  integrability.  This  method  was  used  to  perform  the  exact  quan¬ 
tization  of  several  nonlinear  field  theories  of  physical  interest  such  as  nonlinear  Schrodinger,  sine- 
Gordon  etc.,  and  provides  a  unifying  bridge  with  the  works  of  Bethe,  Yang,  Baxter,  etc.  on  solvable 
lattice  models  of  statistical  physics  [7,13].  Besides  this,  the  QISM  has  shown  to  have  interesting  links 
with  the  theory  of  factorizable  5-matrices,  braid  groups  and  the  newly  bom  discipline  of  quantum 
groups.  In  the  context  of  quantum  integrable  systems  the  QISM  is  used  to  find  the  common  eigen¬ 
vectors  and  the  eigenvalues  of  the  quantum  integrals  of  motion.  The  method  consists  in  including  a 
commutative  subalgebra  of  operators,  corresponding  to  the  quantum  integrals  of  motion,  in  some  as¬ 
sociative  algebra  of  generators  Tap  (m)  {a,p  =  l,...,t/)  satisfying  the  following  quadratic /{-matrix 
relation: 

R(u-v)Tx(u)T2(v)  =  T2{v)Ti(u)R(u-v),  (1.1) 

or  the  linear  r-matrix  relation 

[7',(«),7’2(z;)]  =  lr{u-v),Ti(u)  +  T2{v)],  (1.2) 

with 

7,  (w)  =  r(M)®  id,  72(7;)  =  id®r(u).  (1.3) 

Here  7(m)  denotes  a  (d  x  d)  matrix  with  q-numbers  (operator)  elements,  id  is  the  unit  {d  x  d)- 
matrix,  Riu)  and  r(M)  are  (d^  x  d^)  c-number  matrices  which  satisfy,  respectively,  the  quantum  and 
classical  Yang-Baxter  equations  [7].  The  matrix  7(m)  is  the  monodromy  matrix  (or  Lax  operator) 
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and  the  commutative  subalgebra  is  simply  the  one  generated  by  the  trace  of  T(u)  in  the  quadratic 
case  and  by  the  trace  of  T^(u)  in  the  linear  case.  Sometimes  it  is  possible  to  include  the  family  of 
integrals  of  motion  into  both  linear  and  quadratic  algebras.  In  this  case  two  different  Lax  operators 
for  the  system  must  exist.  Below  we  shall  restrict  ourselves  to  the  case  d  =  2.  By  fixing  i?  and  chosing 
a  representation  of  the  associative  algebra  (1.1)  (or  (1.2))  one  has  an  integrable  quantum  system 
whose  spectral  problem  can  be  solved  by  purely  algebraic  means. 

The  method  of  solution,  however,  may  depend  on  the  particular  representation  (L-operator)  chosen 
for  the  algebra  of  generators.  Two  principal  methods  exist  to  solve  this  problems.  The  first  one  is  the 
so-called  algebraic  Bethe  ansatz  (BA).  This  technique  is  only  applicable  when  the  representation  of 
the  algebra  of  generators 


r(u) 


(  A(u)  B{u)  \ 
\C(u)  D{u)) 


(1.4) 


does  possess  the  highest  vector  |0)  (pseudo vacuum)  such  that 


B|0}  =  0.  (1.5) 

The  application  of  this  method  however  does  not  automatically  guarantee  the  completeness  of  the  ob¬ 
tained  solutions  (it  is  however  believed  that  the  cases  in  which  completeness  problems  arise  constitute 
a  set  of  zero  measure). 

The  second  method  is  the  functional  BA,  also  called  the  method  of  separation  of  variables,  which 
was  originally  introduced  by  Sklyanin  [20,21  ].  This  method  does  not  depend  on  the  existence  of  the 
pseudovacuum  and  has  the  advantage  of  avoiding  the  completeness  problems. 

The  aim  of  the  present  paper  is  twofold.  First,  we  show  on  a  concrete  example  how  one  uses  in  the 
QISM  the  above  methods  to  calculate  the  eigenvalues  and  the  common  eigenvectors  of  the  quantum 
integrals  of  motion.  To  this  end  we  will  use  as  a  working  example  the  quantum  discrete  self-trapping 
(QDST)  dimer  [3,2, 1 9, 1 8  ].  This  system  was  introduced  by  Scott  and  has  been  shown  to  be  completely 
integrable  in  terms  of  a  Lax  pair  related  to  a  quadratic  B-matrix  algebra  [5].  In  this  representation 
there  exists  the  pseudovacuum  so  that  the  algebraic  BA  can  be  applied.  Second,  we  present  new  results 
on  the  QISM  analysis  of  the  QDST  dimer.  More  precisely,  we  show  that  it  is  possible  to  construct 
a  different  Lax  pair  for  the  QDST  dimer  which  is  related  to  a  linear  r-matrix  algebra.  In  this  case 
the  representation  space  does  not  contain  the  pseudovacuum  state  so  that  the  method  of  separation 
of  variables  must  be  used.  These  features  make  the  QDST  dimer  to  be  an  ideal  system  on  which  the 
different  ansatze  used  in  the  theory  of  quantum  integrability  can  be  tested  and  compared.  The  proposed 
paper  is  devoted  exactly  to  the  discussion  of  these  questions.  As  a  result  we  show  that  in  the  case 
of  linear  r-matrix  algebra  the  resonant  QDST  dimer  is  equivalent  to  the  hyperbolic  Gaudin  magnet 
[8]  and  the  eigenfunctions  of  the  integrals  of  motion  are  expressed  in  terms  of  generalized  Hermite 
polynomials  or  confluent  Heun’s  polynomials  [  1 , 1 0, 1 2, 1 6, 1 7,23,24 ].  To  this  end  we  use  the  technique 
developed  in  refs.  [  1 1,14,15]  to  show  that  the  QDST  dimer  separates  in  terms  of  elliptic  coordinates. 
We  note,  that  confluent  Heun’s  polynomials  also  appeared  in  the  quantum  two-dimensional  Coulomb 
problem  which  was  presented  in  details  in  the  book  [12]  and  we  think  it  is  remarkable  that  the 
eigenfunctions  of  the  QDST  dimer  are  expressed  in  terms  of  them.  This  result  is  also  compared  with 
a  direct  solution  of  the  eigenvalues  problem  of  the  QDST  dimer  in  terms  of  su(2)-variables. 

The  paper  is  organized  as  follows.  In  section  2  we  recall  the  quadratic  B-matrix  algebra  for  the 
QDST  dimer  and  the  algebraic  BA  is  briefly  reviewed.  In  section  3  a  new  Lax  operator  related  to  a 
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linear  r-matrix  algebra  is  introduced.  In  section  4  we  show  how  the  method  of  separation  of  variables 
allows  to  solve  the  spectral  problem  of  the  QDST  dimer.  Finally,  in  the  last  section,  we  check  the 
results  of  section  4  by  expressing  the  QDST  Hamiltonian  in  terms  of  su(2)-variables  and  performing 
an  alternative  analysis  of  our  problem  in  functional  space. 


2.  Quadratic  JH-matrix  algebra  for  the  QDST  dimer 

The  QDST  dimer  is  defined  by  the  following  integrals  of  motion:  the  energy  H  and  the  particle 
number  N  [18], 


ff  =  -^^[(^i*!  ~blb2)^  +  N^]  +  (cot  -  jY)blbt  +  (oji-  jy)blb2  -  eiblbj  +  ^>2^1),  (2.1) 

N  =  blbi+  b\b2.  (2.2) 

where  b]  and  ft,  are  creation  and  annihilation  operators  satisfying  the  boson  commutation  relations 
and  y,  e,  <u,,  i  =  1,2,  being  real  parameters.  The  corresponding  system  represents  a  two  degrees  of 
freedom  integrable  Hamiltonian  system  with  equations  of  motion  given  by 

ft,=  [//,ft,],  ft/=[//,ft/],  /=1,2.  (2.3) 


In  the  classical  limit,  these  equations  were  shown  to  be  exactly  solvable  in  terms  of  elliptic  functions 
{ 18).  In  ref.  [5]  the  QDST  dimer  was  analized  within  the  QISM  on  the  background  of  quadratic  R- 
matrix  algebra  defined  by  the  principal  relation  ( 1.1 )  with  </  =  2.  It  was  found  that  the  /?-matrix  for 
QDST  has  the  simplest  rational  form 


R(u) 


/f(u)  0  0  0  \ 

0  1  ^(M)  0 

0  giu)  1  0 

\  0  0  0  /(«)/ 


(2.4) 


with 


f{u)  =  {u-iy)/u,  giu)  = -iy/u. 

The  /.-operator  of  the  system  is  given  by 

Liu)  -  _  (  Aiu)  Biu)\ 

L{U}-L  L  -yciu)Diu))' 

where  are  the  matrices 

Li"»(u)  =  (  y/^bl,\ 

V  v^ftm  i«  /’ 


m  =  1,2. 


In  this  representation  the  pseudovacuum  vector  |0),  satisfying  the  relations 


(2.5) 


(2.6) 


(2.7) 


(0|5  =  0,  .»|0)  =  0 


(2.8) 
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exists  so  that  the  spectrum  of  the  generating  function  of  the  quantum  integrals  of  motion  (i.e.  the 
trace  of  £(«))  can  be  computed  by  the  algebraic  BA.  Here  we  recall  the  main  ideas  of  this  analysis, 
referring  to  refs.  [5,6]  for  more  details.  In  the  algebraic  BA  a  crucial  role  is  played  by  the  so-called 
Bethe  equations 


a(Uk)  _  A  fjuic  -  Uj) 


k  =  l,...n. 


(2.9) 


with  a{u)  and  d(u)  given  by 

^(M)|0)  =  a(M)|0),  D{u)\0)  =  dium.  (2.10) 

If  this  set  of  equations  is  satisfied,  then  the  eigenfunction  of  the  trace  operator  A(u)  +  D(u)  can  be 
constructed  as 


|'P(u„...,u„))  =  n5(u,)|0),  (2.11) 

J  =  i 


where  n  is  the  number  of  quanta  and  Uj,j  =  1 , . . . ,  n,  are  the  Bethe  parameters.  The  eigenvalue  t(u) 
of  the  trace  operator 


(A(u)  +  D(u))\'P(u,,.--,u„))  =  t{u)\*P{uu...,u„)) 
is  given  by  the  formula 

t(u)  =  a(u)  'h-  -  +d{u)  -1--^  — , 

0;= I  («-«;)  11;= I  («-«>) 

with  u\,...Un  satisfying  the  the  Bethe  equations 

a(Uk)  = -d(Uk)Yl^ — —-L  k  =  l,...,n, 

a{u)  =  [M-i((yi  -  ^y)][w-i(w2-  ^y)],  d{u)  =  -el 
The  Hamiltonian  operator  can  be  written  in  terms  of  the  trace  operator  as 


-e^  -I-  ^(tu,  -  ^y)2  -I-  j(w2-  {y)^ 


u=0 


The  energy  levels  Ek  are  given  by  the  formula 

2 


-  e^  -I-  ^(w,  -  ^y)2  -f  5(W2  -  2?)^- 


J  u=0 


To  find  the  energy  levels  Ek  it  is  convenient  to  rewrite  (2.13)  in  the  form  [4] 


t{u)  =  a{u)  -  + 


^n+\iu) 
11;  =  !  (M  —  My) 


(2.12) 


(2.13) 


(2.14) 

(2.15) 

(2.16) 


(2.17) 


(2.18) 
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where  the  polynomial  (m)  of  degree  n  +  1  equals 


<*^n+i(M)  =  [«-i(wi  -  2y)][M-i(m2-  ^y)]  ^n(w-M;-iy)  - 


-e^  “7 + "  n(““  • 

An  alternate  expression  for  0„+i  (m)  is 

n 

*^n+i(u)  =  {-inyu  +  j3„)  JJ(w  -  u;), 

7=1 


(2.19) 


(2.20) 


where  )S„  is  a  new  parameter.  Equating  coefficients  of  powers  of  u  between  (2.19)  and  (2.20)  leads 
to  the  condition  det{fil  +  P)  =  0  [4],  where  /  is  the  (n  +  1 )  x  (n  +  1 )  unit  matrix  and  P  is  the 
(n  +  1 )  X  {n  +  1 )  lower  Hessenberg  matrix, 


<l0,n 

-iy 

0 

0  ... 

(l\,n 

9l,n-I 

-2iy 

0  ... 

0 

92, « 

92,/i-1 

Ql,n-1 

-3iy  ... 

0 

(2.21) 

-Tiiy 

■ 

(ln,n 

9n,n-l 

9n,n-2 

. . . 

9n,0  / 

with  the  matrix  elements  qm,n-h  =  0,...,n  given  by  the  formula 

9m.n-/  =  -e(m~  n  i  2  )  ^  "  2)  ] 

-id(m-l){(jJx  +  <772-  y)(-iy)'""'''  1 )  H  - ^m,|/-i|^(( -  D] 

+  (-i}')'"“'0(m -/)  [071(772  -  jyicoi  +  072)  +  5}'^+  (-1)'""'£^] 

{n-L) 


(2.22) 


From  the  n  +  1  characteristic  values  Pn'^\k  =  1,...,77  +  1,  one  finally  obtains  the  energy  levels  of 
the  QDST  dimer  as 

Ek  =  7J(07|  +  072)  -  ^71(71  +  2)2y  -  (2.23) 

The  proposed  method  works  effectively  for  the  calculation  of  the  energy  levels  for  small  numbers  of 
quanta;  the  explicit  formulae  are  in  refs.  [5,6]. 
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3.  Linear  r-matrix  algebra  for  the  QDST  dimer 


In  this  section  we  introduce  a  new  Lax  operator  for  the  QDST  dimer  related  to  a  linear  r-matrix 
algebra.  The  existence  of  another  Lax  representantion  is  an  interesting  phenomenon  which  is  related  to 
the  possibility  of  constructing  the  different  bases  in  the  space  of  quantum  states.  In  this  representation 
the  QDST  dimer  becomes  equivalent  to  the  two  site  hyperbolic  quantum  Gaudin  magnet  [8]  and 
the  spectral  problem  for  the  quantum  integrals  of  motion  can  be  solved  by  the  method  of  separation 
of  variables.  In  order  to  construct  a  linear  r-matrix  algebra  representation  for  the  QDST  we  take  as 
space  of  the  quantum  states  the  representation  space  of  the  direct  sum  of  two  su(l,  1)  Lie  algebras 
with  generators  Sa,a  =  1,2  satisfying  the  commutation  relations 

=  -i^afiSjktgimsT,  8  =  diag(l,-l,-l).  (3.1) 

In  the  following,  the  norm  and  the  scalar  product  of  operator  vectors  Sa  will  always  be  computed  with 
respect  to  the  metric  g,  i.e. 


si  =  {Sa,S„)  =  (5^)2  -  (5^)2  -  (5^)2^  (5a,5/,)  =  . 


(3.2) 


The  Casimir  operators  of  the  algebra  (3.1)  have  the  form  =  ka^kc.-  1),  withka  =  1,|,2,5,..., 
for  discrete  series  representation,  while  ka  vary  continuously  from  zero  to  infinity,  0  <  feo  <  oo,  for 
representation  of  the  universal  enveloping  algebra  su(  1, 1 ). 

In  terms  of  these  variables  the  L-operator  of  our  system  is  introduced  as 


L(m)  =  (<r,L(M))  = 


{A(u)  Biu)  \ 
\C{u)  -A{u))' 


(3.3) 


where  Hu)  is  the  operator  vector 


L{u) 


Si  S2 

u  -  a  u~b  2 


and  o  denotes  the  su(  1, 1 )  generators  in  2  x  2  representation 


(3.4) 


It  is  easy  to  check  that  the  L-operator  (3.4)  obeys  the  following  linear  r-matrix  algebra: 
ILHu),L'^{v)]  =  y,k,/,m  =  1,2,3.  (3.5) 


This  expression  can  be  rewritten  in  matrix  form  as 


IL|(m),L2(u)]  =  [r{u-v),Li{u)  +  Lziv)], 


(3.6) 
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with  the  matrix  r(u)  being  proportional  to  the  permutation  matrix  as 


r( 


/I  000\ 
00  10 
0  100 
\0  0  0  1/ 


In  (3.6)  the  notations  for  Li  are  the  same  as  in  ( 1.3). 

By  defining  the  quantum  determinant  of  the  L-operator  as 

detZ-(M)  =  -L}{u)  =  -A^{u)  -  {{B{u)Xiu)}, 


(3.7) 


where  the  brackets  {•,  •}  denote  the  anticommutator,  one  can  construct  the  following  commutative 
algebra: 


[L^{u),L\v)]  =  0. 

We  have  then 

,2,  ,  ^2  Hi  H2  1 

(u-ay  {u-by  u-a  u-b  4 

is  the  generating  function  of  the  quantum  integrals  of  motion 

2  , 

H\  =  -•Si  . 

2  , 

Hi  =  T — -(*1,^2)  -  -si- 
b  —  a 


(3.8) 


(3.9) 


(3.10) 

(3.11) 


The  system  defined  by  these  two  integrals  is  called  the  two  site  hyperbolic  Gaudin  magnet  [  8, 1 4,20  ] . 
In  order  to  make  contact  with  the  QDST  dimer  it  is  convenient  to  construct  the  generators  of  the 
algebra  (3.1 )  in  terms  of  new  operators  as 


i  (Pa  +  4  =  3(Pa  -  -^a  = 

with  Pa,Xa  satisfying  Heisenberg’s  commutation  relations 

[Pa,Xp]  =  -iS„g,  [Pa,Pfi]  =  [Xa,Xg]  =  0,  a,  =  1,2. 


(3.12) 


(3.13) 


One  easily  checks  that  for  such  a  realization  of  the  algebra  (3.1)  the  Casimir  operators  have  the  value 
=1,2  (we  deal  with  the  metaplectic  representation  of  the  su(l,  I)  algebra,  ka  =  5,  f  )• 
Performing  the  canonical  transformation 


P\  =  \{b\  +  bi  -tr  b\  bi),  x\  =  (6|  +  6]  -  61  -  62)/2i, 


Pi  =  (-bj  +  b^  +  bi-  b2)l\,  X2  =  {{b\  -  b\ bi  -  62), 


(3.14) 


one  obtains 
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Hi  +  H2  =  —^{b\b\  +  ^2^2  +1). 

2(Hi-H2)=  ((b\bi-b\b2)^  I  {a -b)-  {blb2  +  b\bi).  (3.15) 

By  comparing  these  expressions  with  (2.2),  (2.1 )  we  see  that  the  QDST  system  is  equivalent  to  the 
two  site  Gaudin  magnet  if  coi  =  (02  and  a-b  =  From  (3.3),  using  relations  (3.12),  (3.14) 

one  gets  the  L-operator  of  the  QDST  dimer  satisfying  the  linear  r-matrix  algebra  (3.6). 


4.  The  method  of  separation  of  variables  for  the  QDST  dimer 

The  Lax  operator  (3.3)  introduced  in  the  previous  section  does  not  admit  the  highest  vector  |0)  for 
which 


B(um  =  0,  (0|B(u)  =  0.  (4.1) 

This  is  due  to  the  presence  in  (3.4)  of  the  c-number  term  jCoULO.O).  In  the  method  of  separation 
of  variables  eqs.  (4.1 )  are  replaced  by  the  operator  equation 

B(u)  =  0  (4.2) 


for  the  operator  zeros  uj  of  B.  If  we  substitute  (by  the  left)  u  =  Uj  in  (3.3),  the  L-operator  acquires 
the  form 


Lj 


L{Uj)  = 


(A{Uj)  0  ] 

V  *  -A(Uj)J 


(4.3) 


from  which  we  see  that  ivj  =  A (uj  ),j  =  1 are  the  “operator  eigenvalues”  of  L ( m ) .  We  aim  at 
solving  the  spectral  problem  for  the  generating  function  of  the  integrals  of  motion,  which  in  this  case 
is  given  by  the  operator  determinant 

-L^(u)  =  detL(u).  (4.4) 


Since  the  operation  of  taking  the  determinant  of  L(m)  commutes  with  the  substitution  u  Uj  we  can 
express  (4.4)  in  Uj,Vj  variables  as 

l}(Uj)  ~ -v],  (4.5) 

By  using  the  linear  r-matrix  algebra  for  the  L-operator,  one  can  prove  the  following  commutators: 

\Vj,Uk]  =  -iSjk,  [Ui,Uj]  =  [Vi,Vj]  =  0,  (4.6) 

i.e.  the  uj  and  vj  are  canonical  operators.  This  fact  allows  to  separate  the  eigenvalue  problem  for  L^{u) 
L^(u)'P  =  t{u)^  (4.7) 


into  a  set  of  one  dimensional  problems 

-vfy/iiUi)  =  t{Ui)y/{Ui), 


(4.8) 
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with 

I 

In  the  following  we  show  how  this  works  for  the  QDST  dimer.In  this  case  the  spectral  problem  has 
the  form  (see  (3.9)) 


L^{uW  =  t(uW  Hi'P  =  hi'P, 
t(u)  =  - 


3 

T? 


3 

T5 


h\  1 

(M-a)2  (u-b)^  ^  u-a  ^  u-b  ^  4‘ 


The  separation  variables  are  determined  as  the  roots  of  the  equation 
Biu)  =  -LHu)  +  L^iu)  =  0. 

In  explicit  form  we  have 

jf  2  y2 


u-a  u-b 
with  the  roots  Mi,  U2  given  by  the  equations 

ui  +  U2  =  c  +  b  +  Xi  +  xj,  u\U2  —  ab  +  bx\  +  axj. 
Let  us  introduce  the  following  notation: 


e{u)  = 


1  =  - 


(m-Mi)(m-M2) 


u-a  u-b  ~  {u-a){u-b) 

Taking  the  residues  in  u  =  a  and  u  =  b  at  both  sides  of  (4.13),  we  have 
iui-a){u2-a)  2  {ui-b){u2-b) 


- 

X\  — 


b-a  ’  -^2  a- b 

These  elliptic  coordinates  U|,  1/2  in  satisfy  the  inequalities 


(4.10) 


(4.11) 


(4.12) 


(4.13) 


(4.14) 


a  <  u\  <  b  <  U2  <  00.  (4.15) 

For  each  uj  we  define  the  additional  variable  vj  as  (the  left  substitution!) 

vj  =  -lAiUj)  =  L\uj)  =  j~~.{xuPi}  +  (4.16) 

The  matrix  elements  of  the  L-operator  (3.3)  in  m„v,  variables  then  have  the  form 
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B{u)  =  ~ 
A(u)='^ 


■S}  -  5?  _  5j  ~sl  _  1  (m-U|)(M-M2) 
u-a  u-b  ~2  {u-a)iu-b)  ’ 


{P\,Xi}  ^  i  {P2,X2} 
u-a  ^  4  u-b 


=  2iB{u) 


1 


U-U\ 


D\Vx 


)■ 


(4.17) 

(4.18) 


where 


„  {u\-a)(ux-b)  ^  (u2-a)iu2-b) 

U\  —  - ,  U2  —  - . 

Ml  -U2  U2-  Ml 

By  using  (4.19),  (4.13)  it  is  easy  to  show  that 

-1 


D.=  (-r^ 

\iui-a 


)2  ^  (Ui-by 


j  >0,  1  =  1,; 


(4.19) 


(4.20) 


Note  that  the  meromorphic  operator- valued  function  /4(m)  is  obtained  by  interpolation  with  the  data 
A(uj)  =  ivj  and  the  expression  for  B(m)  follows  from  the  definition  of  the  variables  Uj.  By  equating 
the  residues  at  m  =  a,  6  in  the  r.h.s  and  l.h.s  of  (4.18)  we  obtain 


3{Pa,JCa}  =  -xl  (  --  ■-D2V2  +  ) . 

\^a  -  M2  Ca  -  Ml  J 

where  <*1  =  a  and  ez  =  b. 

The  conjugation  properties  for  the  introduced  operators  have  the  form: 


(4.21) 


M*  =  My,  DjVj  =  v]Dj  (4.22) 

from  which  we  see  that  the  operators  vj  are  not  self-adjoint.  From  the  Vj  one  can  construct  self-adjoint 
operators  Wj  as  follows: 


Wj  =  y/WjVj 


Vj  = 


Wj^/D], 


(4.23) 


and  from  (4.23),  (4.6),  (4.20)  one  can  show  that  the  operators  wj,  Uj  are  also  canonical  ones. 
Substituting  u  =  Uj  into  (3.9)  and  taking  into  the  account  (3.7)  we  obtain  two  operator  equalities 


3  3 

_ ^,2  _ _ 1?  _ _ T? _  _ 

■'  (My  -  fl)2  (My  -  b)^  ^  My  -  M 


(4.24) 


Acting  by  the  right  and  left  hand  sides  of  this  operator  equality  on  !P,  which  is  a  common  eigenfunction 
of  Hi  and  H2,  we  obtain 


vj'f' +  t(uj)^  =  0,  j=l,2,  (4.25) 

where  the  function  t{u)  is  given  in  (4.10).  In  terms  of  wj  the  above  equations  take  the  form 

wj^j^ +  t(uj)^j^  =  0.  (4.26) 


148 


V.Z.Enot’skii  et  a/,  t  The  quantum  inverse  scattering  method 


Let  us  demand  that  the  wave  function  factorizes  as 

=  ^^^exp[-^(ui  +  U2)]<^,  <t>  =  <i>\(ux)(t>2(u2),  (4.27) 

where  the  partial  functions  (ftjiuj)  are  defined  on  the  corresponding  intervals  (4. IS).  Note  that  the 
factor  \/\/DiD2  in  (4.27)  is  connected  to  the  Jacobian  of  the  transformation  from  the  xj  to  the  Uj 
variables, 

|deta(x,,X2)/a(u,,U2)|  =  (4.28) 

By  substituting  (4.27)  in  (4.26)  we  obtain  the  following  separation  equations: 

-<A"(m)  +  (  1  -  - i-r  )  +  -^)  4>{u)  =  0.  (4.29) 

I  u-a  u-bj  \u-a  u-bj 

Here  (and  below)  we  have  omitted  the  indices  since  the  (^’s  satisfy  equations  of  the  same  form,  but  on 
different  intervals.  Equation  (4.29)  is  of  confluent  Heun’s  type  (see,  e.g.  refs.  [1,12] ),  having  a  and 
b  as  regular  singularities  and  oo  as  irregular  singularity  and  with  the  spectral  parameters  hj,j  =1,2, 
given  by 

hi  =  hi  +  \  -\/i(a-b),  A2  =  /!2  +  |-l/8(6-a).  (4.30) 

It  is  worth  to  remark  that  although  the  (ft's  satisfy  the  same  equation  they  are  defined  on  different 
intervals.  In  order  to  get  the  eigenvalues  we  have  to  solve  two  boundary  problems,  one  for  the  angular 
interval  a  <  ui  <  b  and  the  other  for  the  radial  interval  b  <  U2  <  oo.  Furthermore,  both  'F  and 
exp [2 (“Ml  -  M2)] are  demanded  to  be  square  integrable  in  their  variables. 

Let  us  take  the  following  form  for  the  solution  of  eq.  (4.29): 


(j){u)  =  (M-a)*'(M-6)*»d>(M),  0{u)  = 


(4.31) 


j=i 


where  ki  =  0,  These  partial  functions  are  polynomials  multiplied  in  some  cases  by  the  square  root 


of  M  -  a  or/and  u-b.By  substituting  (4.31 )  in  (4.29)  we  get  the  following  equation  for  0: 

-0"+(t.^h±i.?h+A] 0'+  (" Ju  +  =  0, 

\  U-a  u-b  j  \u-a  u-bJ 


(4.32) 


where 


L  _C  .  /,  1  fc,  + /c2  +  4fc|fc2  c  c  .  I.  .  Iki+k2  +  4kik2 

bi  —  hi  +  Ki  —  x - - — r - ,  «2  =  "2  +  A:2  +  x - r - .  (4.33) 

2  a  —  b  2  a  —  b 

Equation  (4.32)  has  just  polynomial  solutions  with  the  zeros  A;  satisfying  the  nonlinear  equations 


1  .  ki  +  \  ki  +  \ 


+ 


+ 


2  X,-b 


=  0, 


(4.34) 
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It  is  possible  to  prove  that  the  zeros  A/  have  the  following  properties  (generalized  Stieltjes  theorem 
[22]):  (i)  they  are  simple,  (ii)  they  are  placed  along  the  real  axis  inside  the  intervals  (4.1S),  (iii) 
they  are  the  critical  points  of  the  function  jG], 

G(A,,. . =exp[-^(2,  +  .-  +2„)]  +  (4.35) 

p=l  r>p 

To  calculate  the  eigenvalues  hi  we  must  first  solve  eqs.  (4.34)  and  then  use  the  formulae 
Hi  =  (2ki  +  a  Z  f  ’  ^2  =  (2/c2  +  11  r> 

7=1  ^  7=1  ^ 

hi  -h  h2  =  —n  =  hj  +  /22  +  fe|  +  )c2  +  j.  (4.36) 

From  (4.36)  we  see  that  the  spectrum  of  Hi  +  has  the  form 

{Hy  +  H2)  =  h\  +  h2  =  — +  ^2^2)  ~  5  =  ~(w  +  ^^1  +  ^^2  +  5)  (4.37) 

and  therefore  the  spectrum  of  N  is 

{N)  =  2(n  +  ki+  ki).  (4.38) 

For  each  fixed  n,  ki  and  ^2  (ie.  for  each  fixed  {N))  we  have  n  +  1  different  eigenfunctions 
0  (u,  )  which  differ  by  the  distribution  of  their  zeros  among  two  intervals:  (a,b)  and  (b,oo). 

The  fcth  polynomial  has  k  zeros  on  the  first  interval  and  n  ~  k  zeros  on  the  second  (k  =  0, . . . , n) 
and  for  fixed  (N)  we  have  (N)  +  I  eigenfunctions. 

Let  us  take  the  alternate  representation  for  the  polynomial  0{u) 

n 

0(u)  =  Y^cjiu- aV,  c„  =  1.  (4.39) 

2=0 

Equivalently  one  can  take  the  expansion  near  another  singular  point  u  —  6.  If  we  denote  u  -  ’  =  z, 

<%  <«  <• 

a  -  b  =  c,  h  =  hi  +  h2  =  -n,  g  =  ehy,  we  have 

-z(z  +  e)<^"(z)  +  [z^  +  z(£  -  2ki  -  2^2  -  1)  -  (2ki  +  ^)e]0'i2)  +  (zh  +  g)0{z)  =  0, 
which  gives  the  following  three-term  recurrent  relations  for  the  cj 

Cj[g-  jU  +  2ki  +2k2-i)]  +  Cj-iU  -  1  -  «)  -c;+i£(;  -l-  1)(;  +  2A:i  +  ^)  =  0, 
j  =  Q,...,n  (4.40) 

with  c_,  =  c„+i  =0.  This  equation  can  be  rewritten  in  the  form 


Ac  =  0, 


(4.41) 
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with  A  being  a  three-diagonal  matrix  which,  after  symmetrization,  reduces  to  a  Jacobi  matrix.  The 
procedure  of  finding  the  «  +  1  eigenvalues  turns  to  the  problem  of  calculating  of  zeros  of  the  charac¬ 
teristic  polynomial  det  /I  =  0.  We  remark  here  that  the  above  prosedure  of  passing  form  the  solutions 
kj  of  the  (4.34)  to  the  polynomial  (4.39)  coefficients  is  analogous  to  the  described  in  the  section  2 
derivation  of  the  energy  levels  on  the  background  of  Bethe  ansatz  for  the  quadratic  /?-matrix  algebra. 

It  is  easy  to  get  the  expressions  of  in  terms  of  the  initial  variables  x,  .  Let  us  dispatch  the  factor 
l/v^/)iZ)2  to  the  weight  when  integrating  upon  the  uj  variables,  so  in  xj  variables  the  wave  function 
has  the  form 


*f'(xi,X2)  =  exp[-^(xf x^)]xf*'x2*^  JJ®(A,),  (4.42) 

i=l 

where  S(ki),  i  =  is  defined  in  (4.13)  and  satisfy  eqs.  (4.34).  One  can  verify  that 

is  an  eigenfunction  of  the  two  commuting  operators  and  Hi  labelled  by  the  quantum  numbers 
n,ki  and  ki  and  by  an  additional  index  enumerating  the  n  -i-  1  different  sets  of  solutions  for  {Ay}"_,. 
If  xi  =  0,  we  have  that  (4.42)  reduces  to  the  usual  Hermite  polynomials,  so  our  functions  are 
generalized  Hermite  plynomials  or  confluent  Heun’s  polynomials  [1,12,17,24).  Heun’s  polynomials 
were  studied  in  ref.  [17]  and  the  confluent  Heun’s  polynomials  first  appeared  in  connection  with  the 
two-dimensional  Coulomb  problem  [12]  and  is  remarkable  that  they  have  also  appear  in  connection 
with  the  eigenfunctions  of  the  QDST  dimer. 


S.  sa(2)  algebra  and  QDST  dimer 

In  this  section  we  show  that  the  QDST  dimer  is  naturally  related  to  the  su(2)  algebra.  We  use  this 
fact  to  check  the  results  of  the  previous  section.  To  this  end,  let  us  introduce  the  operators 


h  =  j{b\bi  -  blbi),  t+ =  ti  +  ill  =  b\b2,  t-  =  t]  -  iti  =  b^bi,  (5.1) 

with  the  operators  6,, satisfying  the  commutation  relations  [^i, ft]]  =  Stj  [ft,,fty]  =  [ft], ft]]  =  0. 
The  operators  (5.1)  are  the  generators  of  the  su(2)  algebra 

[/3,/±]  =  ±/±,  (r+,r-]  =  2/3.  (5.2) 

The  Casimir  operator  is  given  by 

Cl  —  —  sis  1),  (5.3) 

where  5  is  the  eigenvalue  of  the  operator  ^  (ft]fti  +  b^bi ).  In  terms  of  the  generators  (5.2)  the  Hamil¬ 
tonian  of  the  QDST  dimer  becomes 

H  =  -yt\  A-  Aty-leti  -siys  -I-  y  -  (wi  -i-  coz)),  (5.4) 

where  A  =  to  i  -  (Oz  and  the  average  of  the  particle  number  operator  equals 

(N)  =  25.  (5.5) 


If  we  take  the  unitary  irreducible  representation  of  the  su(2)  algebra  in  the  vector  space 
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f3l5,m)  =  m|5,m). 

(5.6) 

f±|5,m)  =  ^/{sT^n){s±m  +  l)|5,m  ±  1), 

(5.7) 

where 

5  =0,  j,l,...  =  jW, 

(5.8) 

then  the  spectrum  of  H  is  simply  obtained  by  diagonalizing  a  three  diagonal  matrix.  These  matrices 
were  written  down  explicitly  in  a  number  of  cases  by  Scott  [18]. 

Here  we  show  that  the  eigenfunctions  V  oi  H  and  N  are  the  confluent  Heun’s  polynomials,  in 
agreement  with  the  result  of  the  previous  section.  To  this  end  we  consider  the  following  realization  of 
the  su(2)  algebra  in  the  space  of  holomorphic  functions: 

=  xdx  -s,  /_  =  dx,  /+  =  -x^dx  +  2sx.  (5.9) 


The  operators  /+  and  /_  are  adjoint  to  each  other  in  the  scalar  product  (x  €  C), 


2s+i  f  (p(x)y/(x)  2„ 


(5.10) 


while  is  a  self-adjoint  operator.  The  basis  vectors  \s,  m)  in  this  representation  have  the  form  /«  = 
CinX^"’’'”  with  Cm  =  [(25)!/(5  -  m)!(5 -f  m)!]‘/^. 

It  is  convenient  to  realize  the  algebra  (5.9)  inthespaceoffunctionsx“^lF'(x)  withx  =  z  + v'z^  -  1. 
Then  the  generators  are 


53  =  \/r2  -  1  di,  s±  =  iz  ±  -  i){s  T  ~  i  dz). 


(5.11) 


In  terms  of  z  the  spectral  problem  H0(z)  =  h0{z)  becomes 


0"  + 


0' 


+ 


Z  +  5  +  1  - 


0)l  +  (02 

y 


(5.12) 


This  equation,  in  the  case  J  =  0,  coincides  with  eq.  (4.29)  with  a  =  2ely,b  =  -2c/}'. 

We  finally  remark  that  eq.  ( 5. 1 2 ),  rewritten  in  the  y  =  In  x  variables,  gives  the  quasi-exactly  solvable 
system  considered  in  ref.  [9]. 


6.  Conclusions 

We  have  shown  that  two  different  representations  for  the  wave  functions  of  the  Mmer  can 

be  constructed.  The  first  one  is  connected  with  the  quadratic  /?-matrix  algebra  and  t»i.,  second  one 
with  the  linear  r-matrix  algebra.  These  two  representations  must  be  connected  by  some  unitary  trans¬ 
formation.  It  is  of  interest  to  find  this  transformation  explicitly,  since  it  would  elucidate  the  link  be¬ 
tween  the  standard  algebraic  BA  and  Baxter  equation  (see  [21  ] )  and  the  polynomial  solutions  of  the 
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separation  equations.  This  question  is  presentely  under  investigation.  A  further  interesting  problem 
under  investigation  is  the  application  of  the  method  of  separation  of  variables  to  ^-deformations  of 
the  QDST  dimer.  This  would  give  the  possibility  of  introducing  a  new  class  of  special  functions  i.e. 
the  ^'-deformed  Heun’s  polynomials. 
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An  evolution  equation  is  presented  to  describe  the  transport  of  a  quantum  mechanical  quasiparticle  such  as  an  electron, 
or  electronic  or  vibrational  excitation,  interacting  strongly  with  lattice  vibrations.  A  generalization  of  the  discrete  nonlinear 
Schrodinger  equation  to  incorporate  dissipation  and  fluctuation  effects  arising  from  interactions  with  a  thermal  reservoir, 
the  equation  predicts  a  multitude  of  interesting  phenomena  including  bifurcations. 


1.  Introduction 

Alwyn  Scott  has  initiated,  performed,  stimu¬ 
lated,  and  guided,  an  amazing  amount  of  current 
research  in  nonlinear  physics.  The  community  of 
nonlinear  scientists  owes  much  to  him.  It  is  a 
pleasure  and  an  honour  to  be  asked  to  make  this 
contribution  to  the  Festschrift  on  the  occasion  of 
his  sixtieth  birthday.  The  range  of  Alwyn’s  work 
in  nonlinear  science  is  vast.  It  spans  a  variety  of 
systems  and  approaches,  and  deals  with  practical 
matters  such  as  the  problem  of  launching  solitons 
and  the  spectra  of  acetanilide,  as  well  as  formal 
matters  such  as  the  dynamics  of  nonlinear 
systems  and  the  generalization  of  nonlinearities 
[1-4]. 

The  subject  of  the  present  article  is  the 
ecumenical  nonlinear  von  Neumann  equation 

Plmn  X(.Pmm  Pnn^Pmn 

-  '  f  Pmni{y^  PLm  “  [K  p]„„) 

-  Mi  -  s„„)ip„„  -  pZ)  ■  (1) 

Before  describing  the  what,  whence,  and  whither 
of  this  equation,  it  is  relevant  to  comment  on  the 


relation  of  the  equation  to  Alwyn  Scott.  There 
exists  a  nonlinear  transport  equation  which  goes 
under  the  name  of  the  discrete  nonlinear  Schrod- 
inger  equation  (DNLSE),  whose  study  has  oc¬ 
cupied  many  investigators,  including  the  present 
author  [5-8],  for  several  years.  Equation  (1)  is 
the  augmented  form  taken  on  by  the  DNSLE, 
when  the  latter  is  opened  to  heat  reservoirs  and 
prepared  for  stochastic  activities  through  the 
introduction  of  dissipation  and  fluctuation.  The 
DNLSE,  on  the  other  hand,  is  identical  in  form 
to  the  so-called  discrete  self-trapping  equation 
(DSTE),  introduced  earlier,  and  studied  exten¬ 
sively,  by  Alwyn  Scott  and  his  collaborators. 

While  identical  in  form,  and  therefore  in  the 
relevance  of  a  number  of  results  such  as  con¬ 
cerning  stationary  states,  the  DSTE  and  the 
DNLSE  differ  significantly  in  physical  meaning. 
This  is  evident  from  the  role  played  by  quantum 
mechanics  in  the  two  equations.  The  complex 
nature  of  the  amplitudes  in  the  DNLSE  are  a 
requirement  of  quantum  mechanics  and  the 
study  of  the  interplay  of  phases  and  nonlinearity 
is  an  undertaking  of  direct  physical  import.  The 
complex  nature  of  the  amplitudes  in  the  DSTE, 
on  the  other  hand,  is  a  convenience  of  the  type 
familiar  in  electromagnetism  where  fields,  which 
are  actually  real,  are  described  by  complex 
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quantities  for  computational  ease.  There  is,  thus, 
work  on  the  “quantization”  of  the  DSTE,  which 
appears  to  have  no  counterpart  at  all  in  the 
context  of  the  DNLSE.  Also,  the  two  equations 
differ  in  physical  origin.  We  begin  with  a  brief 
motivational  discussion  of  the  microscopic  origin 
of  the  DNLSE  in  the  next  section.  After  a  quick 
review  of  earlier  work  on  the  DNLSE  in  the 
following  section,  we  introduce  noninhnite  relax¬ 
ation  and  thermal  fluctuations,  and  arrive  at  the 
ecumenical  equation  (1),  which  has  been  so 
termed  because  it  is  capable  of  resolving  the 
competition  and  conflicts  of  nonlinearity  and 
damping,  or  of  fluctuation  and  dissipation,  since 
it  unifies  the  description  of  all  these  effects  in 
one  fell  swoop.  The  fascinating  consequences  of 
the  augmented  equation  will  occupy  us  in  the 
rest  of  the  article.  The  subject  under  investiga¬ 
tion  is  the  transport  of  a  low  or  intermediate 
mobility  quasiparticle  under  strong  interactions 
with  vibrations.  Examples  are  an  electron  in  a 
narrow-band  material,  a  vibrational  or  electronic 
excitation  in  a  polymer,  and  a  light  interstitial 
such  as  a  proton  or  muon  in  a  metal.  The 
notation  in  (1)  is  standard:  p  represents  the 
density  matrix  of  the  quasiparticle,  the  matrix 
elements  are  taken  in  the  representation  of  some 
localized  states  such  as  a  Wannier  set,  and  the 
four  terms  on  the  right  side  arise  from  intersite 
transfer,  nonlinearity  stemming  from  interactions 
with  vibrations,  finiteness  of  vibrational  relaxa¬ 
tion,  and  thermal  fluctuations,  respectively. 

2.  Physical  motivation  of  the  DNLSE 

Consider  a  moving  quasiparticle  described  by 
the  ket  |'#'(r))  whose  time  evolution  is  governed 
by  the  Hamiltonian  H  through  ift(d|’P(r)))/d/ = 
//|^(/)),  i.e.,  the  standard  Schrodinger  equa¬ 
tion.  On  multiplying  the  equation  by  a  Wannier- 
like  localized  bra  (m|,  one  obtains 


for  the  amplitudes  c„  where  the  E’s  and  the  V's 
are,  respectively,  the  diagonal  and  off-diagonal 
matrix  elements  of  the  Hamiltonian  in  the  local¬ 
ized  basis.  If  interactions  with  vibrations  are 
present,  the  V’s  and  E’s  are  dependent  on  the 
vibrational  coordinates  x.  For  simplicity,  let  us 
take  to  be  independent  of  x,  and  E„  to 
depend  linearly  only  on  the  internal  coordinate 
x„  at  site  m.  In  the  absence  of  interactions,  x„ 
might  have  evolved  sinusoidally  with  frequency 
o)  and  equilibrium  position  0.  It  might  naturally 
obey,  in  the  presence  of  interactions, 

d^jt 

+  = -const.  x|c„p,  (3) 

where  the  equilibrium  position  of  the  oscillator  is 
changed  by  an  amount  proportional  to  the  prob¬ 
ability  that  site  m  is  occupied  by  the  quasi- 
particle.  In  the  presence  of  a  time  scale  disparity, 
if  the  vibrations  are  slaved  by  the  quasiparticle 
probabilities,  the  time  derivatives  in  (3)  may  be 
put  equal  to  zero.  The  vibrational  coordinate  is 
then  proportional  to  the  quasiparticle  occupation 
probability,  and  we  obtain  the  DNLSE 

=  2  -  AT  |c„.  I  .  (4) 

The  above  description  of  the  origin  of  the 
DNLSE  is  meant  to  convey  only  its  essential 
physical  content.  While  we  do  not  wish  to  discuss 
here  the  many  subtleties  which  the  microscopic 
derivation  of  the  DNLSE  entails  [9,10],  we  now 
give  a  brief  description  of  how  the  DNLSE  can 
be  obtained  through  a  semiclassical  approxima¬ 
tion  from  the  Hamiltonian  evolution  of  the 
standard  model  of  a  quasiparticle  interacting 
strongly  with  vibrations,  viz.. 


H  -  ^  ^  V„  „al,a„ 

““  m  m  m  ••  m.n  m  n 

m  m,n 

X  exp(i9  •  RJ  (6,  -I-  bl^)ala, 


(2) 


(5) 
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Equation  (5)  uses  standard  notation  which  we 
will  not  detail  here,  except  for  stating  that  a  and 
b  refer,  respectively,  to  the  quasiparticle  and  the 
vibrations  with  which  it  interacts  strongly.  With 


-«2«2  . 

(6a) 

-i(a|a2  -aja,) , 

(6b) 

(6c) 

-{b^  +  b)l2g. 

{6d) 

the  Hamiltonian  (S)  can  be  written,  for  a  simple 
two-site  symmetric  system  (a  dimer)  interacting 
with  a  single  vibrational  coordinate,  as 

H  =  Vf  +  guip{b^  +  b)  +  a)ib^b  +  \).  (7) 

Here  V  denotes  the  interaction  matrix  element 
for  the  quasiparticle  transfer  between  the  two 
sites  of  the  dimer,  g  measures  the  coupling 
strength  with  the  vibration  of  frequency  to,  and 
we  have  put  the  (identical)  site  energy  of  the  two 
sites  equal  to  zero.'  Henceforth  we  put  ft=l. 
Equation  (7)  then  results  in  the  following  evolu¬ 
tion  for  the  quantities  defined  in  (6): 


^  =  (8a) 

=  -2Vp  -  xfy  ,  (8b) 

dr 

^  =  (8c) 

d^y  2 

(8d) 


Let  us  now  take  the  expectation  values  of  the 
operators  in  an  initial  state  tit,  and  denote  these 
values  by  removing  the  circumflexes,  i.e.,  I  = 
The  single  assumption  that  y  is  classical 
converts  the  exact  dynamics  into  the  DNLSE,  as 
it  is  trivial  to  show  [6-8]  that  eqs.  (8)  rewritten 
without  the  circumflexes  represent  the  DNLSE 
for  the  two-site  system,  the  p,  q  and  r  being 
nothing  other  than  the  standard  Feynman  den¬ 


sity  matrix  element  combinations  of  the  two-site 
system  (dimer); 

P  ~  Pii~  P22  ’  9  ~  KPi2  “  P21) » 

r^Pi2  +  P21  ■  (^) 

While  simple,  the  foregoing  demonstration 
makes  transparent  what  crucial  assumption  is 
responsible  for  the  passage  from  microscopies  to 
evolution  equations  such  as  the  DNLSE.  It  is 
clearly  the  assumption  that  the  vibrations  may  be 
considered  classical.  This  fact,  while  well  known 
to  many,  has  often  gone  unappreciated.  This  has 
led  to  numerous  incorrect,  although  wishful, 
assertions  on  the  one  hand  and  meaningless 
queries  on  the  other. 


3.  Nonlinear  capture  and  trimer,  N-mer 
evolution 

Before  proceeding  with  the  process  of  the 
augmentation  of  the  DNLSE  into  the  ecumenical 
form  (1),  '*  is  of  interest  to  recall  the  large 
number  01  interesting  and  useful  results  which 
have  emerged  from  the  DNLSE  in  a  variety  of 
situations.  Space  limitations  force  us  to  do  no 
more  than  list  the  primary  contexts  of  some  of 
that  work,  along  with  some  slightly  greater  detail 
on  two  specific  areas,  viz.  nonlinear  trapping 
[11,12]  and  exact  analysis  of  some  spatially 
extended  systems  [13,14].  For  further  descrip¬ 
tion,  the  reader  may  refer  to  several  more 
detailed  reviews  [6-8]. 

The  following  is  a  list  of  some  early  work  done 
on  the  DNLSE,  i.e.,  from  the  infinite  relaxation, 
zero  temperature  limit  of  (1): 

(1)  Exact  solutions  for  the  dynamics  of  the 
two-state  system  for  arbitrary  initial  conditions 
and  the  elucidation  of  polaronic  motion  and  self¬ 
trapping  on  the  basis  of  those  solutions  [5]. 

(2)  Application  to  fluorescence  depolariza¬ 
tion,  wherein  the  moving  quasiparticle  is  an 
electronic  excitation  and  the  observable  is  the 
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intensity  of  light  emitted  with  different  polariza¬ 
tions  [6,15]. 

(3)  Application  to  neutron  scattering  of  hy¬ 
drogen  trapped  around  impurities  such  as  ox  gen 
in  metals  such  as  niobium,  where  the  moving 
quasiparticle  is  the  proton  [16]. 

(4)  Application  to  muon  spin  relaxation  in 
antiferromagnetic  solids  such  as  bcc  iron  wherein 
the  quasiparticle  is  a  muon  moving  within  a 
solid,  and  the  observable  is  spin  polarization 

[7.17] . 

(5)  Generalization  of  the  DNLSE  to  incorpo¬ 
rate  anharmonic  potentials  and  nonlinear  restor¬ 
ing  forces,  resulting  in  nonlinearities  other  than 
bilinear,  and  the  appearance,  and  counterintui¬ 
tive  disappearance,  of  multiple  stationary  states 

[6.18] . 

(6)  Calculation  of  memories  in  nonlinear 
generalized  master  equations,  and  the  develop¬ 
ment  of  a  perturbative  scheme  which  is  exact  in 
the  nonlinearity  but  perturbative  in  the  intersite 
transfer  [19j. 

(7)  Studies  of  the  interplay  of  nonlinearity 
and  disorder  on  the  basis  of  ensembles  with 
various  distributions  of  the  nonlinearity  parame¬ 
ter  [6,7]. 

(8)  Theory  of  nonlinear  trapping  of  excitation 
directed  at  fluorescence  quenching  in  molecular 
aggregates  [11,12]. 

(9)  Analytic  solutions  for  a  restricted  class  of 
initial  conditions  in  some  spatially  extended 
systems  such  as  trimers  and  symmetric  A-mers 
[13,14]. 

We  refer  briefly  to  the  last  two  of  these  below. 

3. 1 .  Nonlinear  trapping 

Quasiparticle  trapping  is  an  important  phe¬ 
nomenon,  and  is  of  particular  interest  in  areas  of 
investigation  such  as  photosynthesis,  in  which 
the  harvesting  of  energy  necessary  for  the  oper¬ 
ation  of  the  reaction  centers  is  followed  by  the 
process  of  the  transfer  of  the  harvested  energy  to 
the  reaction  center.  The  excitation  carrying 


energy,  but  no  matter,  is  the  quasiparticle,  and 
the  reaction  center  is  the  trap  [20,21].  Let  us 
assume  that  an  excitation  moves  on  a  chain  via 
nearest  neighbour  interactions  V,  and  trapped  by 
a  site  which  has  the  nonlinear  behavior  arising 
from  strong  interactions  with  vibrations  leading 
to  the  nonlinearity  described  by  the  cubic  term  in 
the  DNLSE.  Of  the  several  possible  models  of 
capture,  consider  two:  one  in  which  one  of  the 
sites  in  the  chain  is  itself  the  trap  site  and 
possesses  the  cubic  nonlinearity,  and  another  in 
which  the  excitation  moves  in  a  region  of  space 
called  the  antenna  and  communicates  with  a  trap 
which  is  external  to  the  antenna.  In  a  simple 
example  of  the  latter  situation,  the  N  antenna 
sites  all  communicate  equally  with  the  external 
trap  through  a  matrix  element  W,  while  transfer¬ 
ring  excitation  among  themselves  through  near¬ 
est  neighbour  elements  V.  The  first  model  is 
represented  by 

(10) 

in  obvious  notation,  while  the  second  is  repre¬ 
sented  by 

i-^  =  nc„.,+c„..,)  +  Wc,,  (11a) 

=  (11b) 

fn 

Numerical  calculations  by  Dunlap,  Kenkre  and 
Reineker  [11]  on  the  first  model  have  shown  that 
a  transition  appears  to  occur  as  the  nonlinearity 
parameter  x  crosses  the  value  (3.2)V.  The  sec¬ 
ond  model  has  been  solved  analytically  by  Ken¬ 
kre  and  Kus  [12]  who  have  shown  that  the 
probability  of  the  trap  site,  which  we  will  call  P, 
obeys 

-3^T)P^-2eP\  (12) 

where,  with  the  notation  that  w  is  defined  as 

VA  W,  the  scaled  time  t  and  the  parameters  ^ 
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and  17  are  t  =  2wt,  f  2nd  17  =  Ww,  the  /’s 

being  constants  of  the  motion.  Clearly,  ^  mea¬ 
sures  the  amount  of  nonlinearity  relative  to  the 
(scaled)  interaction  matrix  element  for  transfer 
of  excitation  between  the  antenna  and  the  trap 
while  Tj  measures  the  motion  of  the  excitation 
within  the  antenna  relative  to  the  extent  of  its 
transfer  from  the  antenna  to  the  trap.  Equation 
(12)  can  be  solved  in  terms  of  Weierstrassian 
elliptic  functions.  A  completely  general  solution 
to  the  problem  has  been  obtained  for  arbitrary 
initial  conditions,  and  a  number  of  physical 
features  have  been  extracted  [12).  The  analysis 
has  also  been  generalized  to  finite  relaxation  and 
finite  temperature  situations. 

3.2.  Spatially  extended  systems:  analytic  results 

The  methods  of  analysis  used  in  the  early  work 
[5]  to  obtain  exact  solutions  in  two-site  systems 
were  generalized  by  Andersen  and  Kenkre  to 
some  larger  systems  including  trimers  and  a  class 
of  extended  systems  termed  N-mers  [13,14]. 
They  obtained  the  explicit  time  dependence  for 
the  trimer  for  a  class  of  special  initial  conditions, 
discussed  a  transition  at  as  well  as  the 

effect  of  the  sign  of  the  nonlinearity,  and  showed 
the  connection  of  their  results  to  the  trimer 
stationary  states  obtained  earlier  by  Eilbeck  et 
al.  [3].  They  generalized  the  work  to  A-mers, 
also  obtaining  explicit  analytical  solutions.  The 
systems  considered  were  site-degenerate  with 
=  V  between  any  two  sites,  the  initial  excita¬ 
tion  being  localized  on  a  single  site,  or  more 
generally,  distributed  equally  among  all  sites  of 
one  of  two  groups,  one  of  sites  and  the  other 
of  mg  =  N  -  sites.  The  calculational  trick  of 
Andersen  and  Kenkre  [13,14]  consists  of  the 
generalization  of  (9)  to  define  new  quantities  p, 
q,  and  r,  which  allow  one  to  convert  the  trimer 
or  N-mer  problem  as  described  by 

-\v(^~c„  +  J,c^  +  ix\cSc„,  (13) 


into  a  completely  tractable  non-degenerate 
dimer  problem  [22]  involving  the  solution  of 

d^p  2  3 

=  -2%  -  (>yiP  -  ■  (14) 

Some  of  this  work  is  reappearing  in  the  literature 
and  is  being  applied  to  experimental  situations. 

4.  Finite  relaxation  and  thermal  fluctuations 

How  do  we  augment  the  DNLSE  (4)  to  treat 
situations  not  involving  time  scale  disparities 
between  the  quasiparticle  and  the  vibrations,  to 
eliminate  unrealistic  assumptions  such  as  that  of 
infinite  vibrational  relaxation,  and  to  include 
unavoidable,  and  important,  interactions  with 
reservoirs  which  give  rise  to  temperature  effects 
and  fluctuations?  This  is  the  question  that  we 
now  address.  Equation  (4)  is  the  result  of  a  time 
disparity  assumption  made  on  (2)  and  (3).  We 
now  retain  (2),  and  replace  (3)  by 

d^x„  tlc„  ,  ,  ,, 

+  +  ^  =  -const.  X  + 

(15) 

and  explore  the  consequences  of  dissipation 
introduced  by  the  rate  y,  and  of  fluctuation 
caused  by  the  random  force  term  R„{t).  For 
simplicity,  we  have  used  the  restriction  that  the 
reservoir  interaction  occurs  only  with  the  vi¬ 
brational  syst'^m.  Two  limits  of  (15)  are  par¬ 
ticularly  interesting:  zero  damping,  and  extreme¬ 
ly  large  damping.  Some  exact  solutions  can  be 
found  for  the  former  case  [23]  for  the  dimer.  The 
probability  difference  p{t)  shows  cn  or  dn  be¬ 
havior  as  in  the  adiabatic  solutions  [5]  and 
undergoes  a  characteristic  new  transition  into  a 
region  where  it  equals  the  sum  of  a  part  which  is 
proportional  to  the  appropriate  elliptic  function 
(cn  or  dn),  and  a  part  which  is  proportional  to  the 
cube  of  the  elliptic  function.  Here,  we  will  focus 
on  the  opposite  limit  of  extremely  large  damp¬ 
ing. 

If  the  damping  is  large  enough  to  justify  the 
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neglect  of  the  second  derivative  of  the  oscillator 
displacements,  more  formally,  if  tv— »ao, 
oi^ly-F,  the  evolution  of  the  oscillator  displace¬ 
ments  towards  their  equilibrium  positions  now 
possesses  a  single  characteristic  “vibrational  re¬ 
laxation”  rate  F.  Restricting  the  analysis  to  the 
two-site  system  in  the  interest  of  full  tractability, 
we  obtain  from  (2)  and  the  large  damping  limit 
of  (15), 

dp  dq  dr 

-^  =  2Vq,  -^=-2Vp-xyr, 

(16a) 


-^=-F{y-p)  +  F{t),  F(t)  =  IFikT/to^) S(t) , 

(16b) 

where  k  is  the  Boltzmann  constant. 

Equations  (16)  constitute  a  Langevin  set. 
Standard  techniques  allow  the  derivation  of  an 
exact  Fokker-Planck  equation  for  the  distribu¬ 
tion  function  in  p,  q,  r,  y  space  and  of  an 
approximate  but  useful  Fokker-Planck  equation 
for  the  distribution  function  o-(p,  q,  r;  i)  in  p,  q, 
r  space  alone.  The  latter  is  obtained  [24]  through 
the  application  of  projection  techniques  which 
eliminate  the  vibrational  variable  y,  and  is  valid 
in  the  high  damping  limit.  The  Fokker-Planck 
equation  can  be  solved  exactly  for  its  stationary 
state  distribution  function,  and  the  formalism 
can  be  used  in  two  separate  ways:  (i)  to  perform 
a  Kramer’s  first  passage  time  analysis  aimed  at 
an  investigation  of  the  stability  (against  thermal 
fluctuations)  of  the  nonlinear  structures  inherent 
in  the  DNLSE,  and  (ii)  to  carry  out  a  contrac¬ 
tion  analysis  [25]  from  the  Fokker-Planck  equa¬ 
tion  in  order  to  arrive  at  a  closed  equation  such 
as  (1)  for  the  quasiparticle  variables.  We  refer 
the  reader  elsewhere  [24]  for  (i),  and  concen¬ 
trate  on  the  results  of  (ii)  here.  The  contraction 
analysis  has,  as  its  consequence. 


dr 


=  2Vq, 


dt 


-2Vp-xpr  + 


2Vx 


qr-aq  , 


(17a) 

(17b) 


2Vx  2  ,  . 

-^  =  xpq--[^q  -«('•-^q)• 


(17c) 


Here,  is  the  thermal  equilibrium  value  of  r, 
and  a  is  a  rate  which  attempts  to  drive  the 
system  to  the  thermal  state.  A  high  temperature 
expression  [25]  for  this  rate  is  a  =  {2x/F)kT. 

The  set  of  equations  (17)  have  been  arrived  at 
by  extending  the  DNLSE  to  nonadiabatic,  finite 
temperature  situations  through  a  blend  of 
analytic  arguments  and  physical  assumptions. 
The  system  they  describe  reduces  to  the  trivial 
linear  dimer  if  ;^  =  0  =  a,  to  the  high-tempera- 
ture  darned  linear  dimer  if  x  vanishes  but  a  does 
not,  to  the  nonlinear  adiabatic  dimer  if  x  is  finite 
but  F  is  infinite  and  a  vanishes,  to  a  relatively 
crude  extension  [26]  of  the  nonlinear  dimer  to 
dissipative  situations  if  x  is  finite  but  F  is  infinite 
and  a  vanishes,  and  to  the  nonlinear 
nonadiabatic  dimer  if  x  and  F  are  finite  and  a 
vanishes.  The  latter  case  displays  a  rich  multi¬ 
tude  of  phenomena  including  a  fascinating  inter¬ 
play  of  quantum  phases  and  nonlinearity,  for 
which  we  refer  the  reader  to  refs.  [27].  Evolution 
showing  a  complete  combination  of  the  elliptic 
function  evolution  of  the  adiabatic  dimer  for 
short  times,  followed  by  a  self-trapping  swing 
into  the  localized  stationary  states  of  the  dimer 
for  longer  times,  which  is  itself  followed  by  a 
delocalization  and  symmetrical  spreading  over 
the  two  sites  characteristic  of  thermal  fluctua¬ 
tions,  has  been  exhibited  and  commented  on  in 
ref.  [25]  for  the  high-temperature  case  when 
which  can  generally  be  taken  to  equal  tanh(V7 
kT),  vanishes.  Exciting  new  behavior,  which 
occurs  when  does  not  vanish,  will  now  be 
mentioned  here  briefly. 

For  vanishing  a,  the  probability  difference  p 
oscillates  and  then  tends  to  the  stationary  value 
which  is  0  if  the  nonlinearity  parameter  is  small 
enough  and  finite  (corresponding  to  a  localized 
state)  if  it  is  large  enough.  As  a  increases,  the 
detrapping  effect  is  seen:  p  tends  to  0  at  larger 
times  even  for  large  nonlinearities.  As  a  in¬ 
creases  further,  a  surprising  burst  of  p  occurs  for 
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a  short  time,  and  the  burst  recurs  after  a  time 
period.  The  bursts  become  more  frequent  with 
further  increase  of  a  and  behavior  that  appears 
chaotic  occurs.  Phase  space  plots  in  the  p-q 
plane  show  that  a  limit  cycle  has  been  reached  at 
this  point,  as  p  (as  well  as  q)  oscillates  steadily 
between  two  finite  values  (see  (iv)  and  (v)  in  fig. 
1).  A  further  increase  in  a  destroys  the  limit 
cycle,  and  stable  dissipative  behavior  is  recov¬ 
ered  (case  (vi));  p,  q  tend  to  vanishing  values 
while  r  tends  to  Stability  analysis  carried  out 


by  Kenkre  and  Kus*  [26]  shows  that  the  destruc¬ 
tion  of  limit  cycles  occurs  for  a  >  i.e.,  for 

temp>eratures  exceeding  that  at  which  VIkT 
becomes  smaller  than  its  hyperbolic  cotangent  if 
X  and  r  are  assumed  to  be  independent  of 
temperature. 

This  bifurcation  behavior  is  reflected  in  excit¬ 
ing  predicted  features  of  several  observable 
quantities  [26].  An  example  is  the  degree  of 
fluorescence  polarization  in  poly-L-proline  oligo¬ 
mers  of  variable  length,  which  is  given  [6]  by  a 


Fig.  1.  p~q  phase  space  plots  from  (17),  the  dimer  form  of  (1),  showing  bifurcaton  behavior.  Each  frame  extends  from  —  1  to  1 
on  the  horizontal  p  and  the  vertical  q  axis,  and  the  initial  condition  is  of  one-site  occupation,  i.e.,  p  =  \,  q=Q.  The  nonlinearity 
xHV  is  larger  than  I:  localized  states  exist.  The  values  of  a  12V  ate  (i)  0.002,  (ii)  0.00968,  (iii)  0.04685,  (iv)  0.22676,  and  (vi) 
1.09752.  In  (i),  one  sees  clear  evolution  to  the  self-trapped  stationary  state  (p  <  0)  followed  by  symmetrization  (p  =  0)  as  a  result 
of  thermal  fluctuation.  Limit  cycles  are  destroyed  in  (vi). 
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linear  superposition  of  p  and  r,  the  coefficients  of 
the  superposition  being  subject  to  experimental 
control. 


5.  Concluding  remarks 

The  derivation  of  (1)  has  proceeded  through  a 
combination  of  exact  analysis  and  physical  argu¬ 
ments.  The  semiclassical  approximation  which 
yields  the  DNLSE,  the  reservoir  interaction,  also 
introduced  classically,  which  results  in  the  Fok- 
ker-Planck  equation,  the  high  damping  limit, 
the  contraction  analysis  from  the  Fokker-Planck 
equation,  and  the  generalization  from  the  dimer 
back  to  the  extended  system  necessary  to  pass 
from  (17)  to  (1),  are  the  primary  ingredients  of 
that  combination.  We  have  seen  the  rich 
behavior  that  emerges  from  (1).  It  encompasses 
simple  linear  evolution,  reflects  the  characteristic 
features  of  nonlinear  dynamics,  describes  the 
settling  of  the  system  into  stationary  states  as  a 
result  of  dissipation  and  finite  relaxation,  and 
predicts  fascinating  bifurcations.  What  is  re¬ 
quired  in  future  investigations,  in  the  opinion  of 
the  present  author,  is  not  only  a  better  under¬ 
standing  of  the  range  of  validity  and  applicability 
of  this  and  similar  transport  instruments,  but  also 
(and  especially)  work  on  the  design  of  specific 
experiments  on  the  basis  of  the  kind  of  theoret¬ 
ical  predictions  that  have  been  described  in  the 
present  article.  The  depolarization  of  fluores¬ 
cence,  the  scattering  of  probe  particles  such  as 
neutrons  by  light  interstitials,  the  quenching  of 
fluorescence  via  trapping,  and  the  evolution  of 
transient  gratings  are  particular  candidates  for 
the  experimental  probe.  It  is  hoped  that  the 
development  of  theoretical  analysis  and  the 
design  of  experiments  will  occur  in  close  collabo¬ 
ration  in  the  near  future. 
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We  report  some  recent  results  related  to  quantum  integrable  systems.  The  Baxter  formula  is  generalized  into  the  case  of 
finite  temperature.  We  reformulate  the  quantum  inverse  scattering  method  for  ID  quantum  systems  with  long-range 
interactions.  Extensions  of  the  Gaudin  model  and  integrability  of  the  Calogero  model  are  discussed. 


1.  Introduction 

In  1965,  the  soliton  as  a  new  concept  in 
nonlinear  dynamics  was  introduced  by  2^busky 
and  Kruskal  [1].  Exciting  findings  in  the  pioneer 
period  were  collected  in  an  excellent  review 
article  by  Scott,  Chu  and  McLaughlin  [2].  Dur¬ 
ing  the  last  decade,  there  continued  to  be  many 
important  developments  [3].  One  of  them  is  a 
unification  of  exactly  solvable  models  in  physics. 
The  quantum  inverse  scattering  method  places 
the  theory  of  exactly  solvab’  dels  in  a  unified 
framework  and  provides  tul  method  for 

studying  those  models.  Soi  models  in  (1  -l- 
1  )-dimensionai  quantum  theory  and  in  2-dimen- 
sional  classical  statistical  mechanics  share  a  com¬ 
mon  property;  to  each  model  we  can  associate  a 
family  of  commuting  transfer  matrices  which  are 
generators  of  an  infinite  number  of  conserved 
quantities.  This  property  may  correspond  to  the 
Liouville  theorem  for  classical  Hamiltonian 
systems.  A  sufficient  condition  for  the  corn- 
mutability  of  the  transfer  matrices  is  called  the 
Yang-Baxter  relation  [4].  The  Yang-Baxter  re¬ 
lation  is  a  key  of  new  ideas  and  new  concepts  in 
recent  mathematical  physics  such  as  knot  theory 
based  on  solvable  models,  and  quantum 
groups. 
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This  refiort  presents  some  recent  results  re¬ 
lated  to  the  quantum  integrable  systems.  It  is 
known  that  there  exists  a  correspondence  be¬ 
tween  ID  quantum  system  and  2D  classical 
system.  The  Baxter’s  formula  relates  a  2D  solv¬ 
able  lattice  model  to  a  ID  integrable  spin 
system.  In  section  2,  we  extend  the  Baxter’s 
formula  into  the  case  of  finite  temp>erature. 
Combining  this  extension  with  the  evaluation  of 
finite  size  corrections,  we  obtain  a  systematic 
method  to  calculate  low  temperature  expansions 
of  thermodynamic  quantities  [5-7].  Sections  3 
and  4  deal  with  integrable  systems  with  long- 
range  interactions.  Some  time  ago,  Gaudin  intro¬ 
duced  a  class  of  such  spin  models  using  the 
Bethe  ansatz  method.  In  section  3,  we  give  an 
algebraic  formulation  of  the  models.  Introducing 
inhomogeneities  into  lattice  models,  we  present 
a  general  method  to  construct  quantum  integra¬ 
ble  spin  Hamiltonians  with  long-range  pairwise 
interactions.  The  last  section  is  devoted  to  a 
reformulation  of  the  quantum  inverse  scattering 
method  for  ID  quantum  particle  systems.  In 
particular,  construction  of  conserved  operators 
for  the  Calogero-Moser  system  is  explicitly 
shown.  The  expression  in  terms  of  the  Lax 
operator  is  new.  Further,  we  find  that  the  Lax 
pair  yields  an  interesting  algebra. 
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2.  Finite  temperature  Baxter’s  formula 

We  consider  a  square  lattice  with  M  rows  and 
N  columns  under  the  periodic  boundary  condi¬ 
tion  (M  +  1  =  1,  N  +  1  =  1),  and  define  a  model 
of  classical  statistical  mechanics  on  the  lattice. 
We  assume  that  the  Boltzmann  weights  of  the 
lattice  model  satisfy  the  Yang-Baxter  relation. 
Let  Tf^{u)  denote  the  transfer  matrix,  where  u  is 
the  spectral  parameter.  The  Yang-Baxter  rela¬ 
tion  assumes  the  commutability  of  the  transfer 
matrices: 

[T»,7^(u)]  =  0.  (2.1) 

It  is  known  that  there  is  a  relationship  between 
the  row-to-row  transfer  matrix  Tf^(u)  and  a  ID 
quantum  Hamiltonian  H: 

=  .  (2.2) 

uU  u*0 

For  instance,  the  Heisenberg  XYZ  (XXZ)  model 
is  related  to  the  eight-  (six-)  vertex  model  by  a 
formula  (2.2).  We  call  (2.2)  Baxter’s  formula. 

The  relation  (2.1)  contains  useful  information. 
Expanding  the  transfer  matrix  in  powers  of  the 
spectral  parameter,  we  can  easily  verify  that  the 
transfer  matrix  is  a  generator  of  an  infinite 
number  of  conserved  quantities  (operators).  The 
first  term  in  such  an  expansion  is  the  shift 
operator: 

■  ■  ■  S(b^,  a^_,) ,  (2.3) 

where  a  =  {a^}  and  b  =  {bj}  denote  state  vari¬ 
ables  respectively  in  lower  and  upper  rows,  and 
S{bj,aj)  is  the  Kronecker  delta.  A  set  of  the 
conserved  quantities,  (4),  is  given  by 

d* 

4  =TTlog7’A,(«)  (2.4) 

OU  u=0 

Baxter’s  formula  (2.2)  corresponds  to  7,  up  to  a 
sign. 

The  above  discussion  indicates  that 
[Tj^{u),  li^]  =  0  and  that  T^{u)  and  {/^}  have 
common  eigenstates.  Thus,  Baxter’s  formula 


(2.2)  leads  to  the  relationship  between  the 
ground-state  energy  of  H  and  the  maximum 
eigenvalue  A„„(«)  of  T^(u): 

=  (2-5) 

We  have  observed  that  the  equivalence  through 
Baxter’s  formula  between  a  ID  integrable  quan¬ 
tum  system  and  a  2D  solvable  lattice  model  is 
useful  in  particular  when  we  discuss  the  zero- 
temperature  properties  (e.g.  ground  state 
energy)  of  the  former.  A  challenging  problem 
may  be  how  we  extend  Baxter’s  formula  to  study 
the  finite  temperature  properties. 

We  look  for  a  new  approach  to  analyze 
thermodynamic  properties  of  the  ID  quantum 
system.  Baxter’s  formula  (2.2)  assures  the  ex¬ 
pansion, 

7»  =  7^(0)  [1-UH  +  €iu^)] .  (2.6) 

Using  Trotter’s  formula,  we  have 

expi-pH)  =  lim  (7^(0)-'  T^{piM)f  ,  (2.7) 

where  p-Mk^T  as  usual.  Since  Ti^(0)  is  the 
shift  operator,  we  may  interpret  7;^(0)”*  T^^iu) 
as  a  diagonal-to-diagonal  transfer  matrix  and 
denote  it  by  7^^^(«).  Then,  the  free  energy  per 
site,  /,  of  the  ID  quantum  system  can  be 
expressed  in  terms  of  the  partition  function 
of  a  2D  classical  model  of  the  lattice: 

f""  i'*"  •‘’S  ^Mv(«) .  (2-8a) 

ZM  =  Tr(7“^'^(«))"  ,  u  -  PIM  .  (2.8b) 

The  eigenvalues  of  the  transfer  matrix  are  infi¬ 
nitely  degenerate  as  u—*0.  Therefore,  (2.8)  is 
not  practical  and  remains  to  be  formal.  To  avoid 
this  situation,  we  introduce  a  trick:  we  look  at 
the  lattice  from  the  crossing  channel,  that  is, 
from  a  90°  rotated  frame. 

Because  of  the  crossing  symmetry  of  the 
model,  this  manipulation  is  made  by  the  change 
in  the  spectral  parameter  u  into  A  -  u,  where  A  is 


164 


A#.  Wadati  et  <U.  /  Quantum  integrable  systems 


called  the  crossing  parameter  [4,5].  Corre¬ 
spondingly,  we  introduce  a  notation  T^{u)  = 
7'^^'“(A  -  «).  Now,  (2.8)  is  replaced  by 

/  =  -^  Wm^  log  .  (2.9a) 

Z^^{u)  =  Tr{Ti{u))\  (2.9b) 

where  Ai^{u)  is  the  largest  eigenvalue  of  the 
crossing  transfer  matrix  T^(u).  Equation  (2.9)  is 
a  finite  temperature  extension  of  Baxter’s  for¬ 
mula  (2.2),  and  has  been  named  finite  tempera¬ 
ture  Baxter’s  formula.  Remark  that  the  spectral 
parameter  u,  a  fundamental  quantity  in  soliton 
theory,  now  plays  a  role  of  intertwiner  between  a 
finite  size  system  and  a  finite  temperature 
system. 

A  caution  is  necessary  for  making  use  of 
formula  (2.9).  Since  the  temperature  depen¬ 
dence  enters  through  u  =  fi/M,  one  should  be 
careful  in  taking  the  limit  A/— Otherwise,  any 
information  of  finite  temperature  will  be  lost. 
Instead,  a  finite  size  calculation  or  a  1/Af-expan- 
sion  gives  a  systematic  method  to  obtain  low 
temperature  expansions  of  thermodynamic  quan¬ 
tities.  Figure  1  summarizes  the  significance  of  the 
limits  r-^0  and  for  the  ID  quantum 

system  and  the  2D  classical  system. 

The  theory  of  finite  size  corrections  has  at¬ 
tracted  much  attention  of  theoretical  physicists 
since  it  gives  central  charge  c  and  scaling  dimen¬ 
sion  X  which  are  essential  quantities  in  the 
conformal  field  theory.  To  evaluate  finite  size 
corrections,  we  adopt  a  method  by  de  Vega, 
Woynarovich  and  Eckle  [8].  The  calculation  is 


Fig.  I,  Relations  between  finite  size  system  and  finite  tem¬ 
perature  system. 


involved  but  straightforward.  We  apply  the 
Euler-Mclaurin  expansion  formula  and  the  solu¬ 
tion  technique  of  the  Wiener-Hopf  integral 
equation  to  the  Bethe  ansatz  equation. 

The  new  approach  to  finite  temperature  quan¬ 
tum  systems  presented  above  has  been  success¬ 
fully  applied  [5-7].  It  should  be  stressed  that  this 
approach  is  valid  for  any  quantum  chain  as  long 
as  it  is  solvable  by  the  Bethe  ansatz  method.  We 
only  mention  here  the  results  for  the  XXZ  spin 
chain  in  a  gapless  phase: 

N 

H  =  JtY  (o-;£r;^,  +4  cr;a,\,)  , 

/=i 

(2.10) 

4  =  cos  A.  (2.11) 

The  free  energy  /  with  the  first  thermal  correc¬ 
tion  is 


/=/x 


A(^b7’)^ 

12  sin  J-p  ’ 


(2.12) 


where  is  the  ground  state  energy.  The  result 
(2.12)  is  the  same  as  the  one  obtained  by  the 
Bethe  ansatz  method  with  the  string  hypothesis. 

Calculation  of  the  higher  terms  in  T  can  be 
done  by  making  use  of  the  perturbation  theory 
of  the  Wiener-Hopf  integral  equation.  In  the 
limit  A— >0  (the  XXX  model  limit),  the  correc¬ 
tion  terms  have  rather  simple  expressions.  The 
free  energy  /  and  the  correlation  length  ^  are 
respectively  given  by 

^  \2Jr  2/^  [105(7^/^3  7)]^ 

(2.13) 


^  r  riog(i/7) 

where  Oj  and  are  constants.  Two  remarks 
should  be  made.  First,  the  appearances  of  logar¬ 
ithmic  terms  in  (2.13)  and  (2.14)  are  notewor¬ 
thy.  This  is  the  first  analytical  derivation  of  such 
terms  and  has  been  confirmed  by  numerical 
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calculations  based  on  a  different  formalism. 
Second,  we  find  that  the  calculation  of  the 
correlation  length  is  quite  straightforward,  by 
making  use  of  the  result  for  the  corresponding 
2D  classical  models.  This  is  remarkable  since  in 
string  hypothesis  approach  we  can  obtain  only 
the  free  energy  and  cannot  get  any  information 
on  the  correlation  length. 


3.  The  Gaudin  model  and  generalizations 

Depending  on  the  interaction  ranges,  com¬ 
pletely  integrable  systems  may  be  classified  into 
two  groups.  One  is  a  system  with  short-range 
interactions  including  the  5-function  gas,  the 
Heisenberg  XYZ  model  and  the  Toda  lattice. 
The  other  is  a  system  with  long-range  interac¬ 
tions.  Contrary  to  the  short-range  interaction 
models,  there  has  not  been  a  systematic  work  on 
the  long-range  interaction  models.  So  far  we 
know  two  integrable  systems,  the  Calogero- 
Moser  model  and  the  Gaudin  model  [9].  Recent¬ 
ly  there  has  been  a  renewed  interest  in  the 
long-range  interactions  models.  It  is  believed 
that  the  models  have  the  Jastrow-type  wave 
functions  which  play  a  central  role  in  the  theory 
of  the  quantum  Hall  effects  and  the  high-T^ 
superconductors.  In  this  section,  we  present  a 
general  method  [10,11]  to  construct  quantum 
integrable  spin  Hamiltonians  with  long-range 
pairwise  interactions  which  extend  the  Gaudin 
model. 

Let  the  matrix  R  act  in  a  tensor  product  V 
of  the  linear  space  V.  The  Yang-Baxter  relation 
reads 

/?, 2 (m  -{;)/?, 3(m)  R^^{v) 

=  /?23(t;) /?,,(«) /?,2(u-u).  (3.1) 

Here  R^^  denotes  the  matrix  on  V  (8i  V  (g)  V,  acting 
as  R  on  the  ith  and  the  /th  spaces  and  as  identity 
on  the  other  space.  A  solution  of  (3.1)  is  referred 
to  as  R-matrix.  The  R-matrix  depends  on  the 
spectral  parameter  u  (or  v)  and  other  parame¬ 


ters,  one  of  which  we  denote  by  rf.  It  is  sometime 
convenient  to  use  the  ^-matrix  defined  by 

/J(u;t,)  =  R(«;t,)R,  (3.2) 

where  P  is  a  permutation  operator,  P{x^y)  = 
y<^x.  For  R(«;tj)  and  we  assume  the 

following  two  conditions. 

(I)  regular  condition, 

R(u  =  0;t,)  =  P,  R(u  =  0;t,)  =  /,  (3.3) 

(II)  quasi-classical  condition, 

R(u;t,  =  0)  =  /,  R(u;7,  =  0)  =  P.  (3.4) 

(3.3)  is  also  called  the  standard  initial  condition. 

In  terms  of  R-matrices,  we  define  the  transi¬ 
tion  matrix  t^{u)  and  the  transfer  matrix  T^{u) 
by 

=  Rof^{u-Xi^)---Roxiu-x^) ,  (3.5) 

TNiu)=T^{u-,  ^};t)) 

=  Trr».  (3.6) 

Remark  the  shift  Xj  of  the  spectral  parameter  in 
Rqj.  We  regard  R^*  as  a  matrix  acting  on  V^  0V4, 
where  V;,  is  the  “auxiliary”  space  and  V*  denotes 
the  ^th  site  on  a  one-dimensional  lattice.  In 
(3.6),  Tr  means  trace  in  the  space  Vq.  Then,  the 
system  (3.5)  is  identical  to  an  inhomogeneous 
ID  lattice  with  N  sites  (fig.  2). 

Because  of  the  additive  property  of  the  spec¬ 
tral  parameters,  the  Yang-Baxter  relation  re¬ 
mains  valid  for  the  inhomogeneous  lattice.  Then, 
the  transfer  matrices  (3.6)  with  different  spectral 
parameters  commute. 


u-x, 

U-X2 

U-Xn 

Fig.  2.  Inhomogeneous  lattice. 
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^  T^iu)  r»  -  T^iu)  =  0 .  (3.7) 

This  way  of  introducing  the  inhomogeneity  is 
known  as  the  Z-invariance. 

Let  I  A)  be  an  eigenstate  of  Tf^(u)  which  is 
characterized  by  a  set  of  “quasi-momenta”  A,, 
a  set  of  mutually  commuting 
operators  and  the  eigenvalue  problem  are  re¬ 
spectively  expressed  as 

Z*  =  =  X*;  ,  *  =  1,2 . N,  (3.8) 

Of  course,  explicit  forms  of  c(A)  and  A^  are 
model-dependent. 

We  differentiate  both  sides  of  (3.9)  with  re¬ 
spect  to  7j.  Then,  setting  tj  =  0  and  using  the 
quasi-classical  condition  (3.4),  we  get 


«*|A)l,=o  (^n  (3.1 


where 


(3.11) 


We  have  found  that  //*  in  (3.11)  can  be  ex¬ 
pressed  concisely  in  terms  of  the  /?-matrices.  We 
denote  by  the  transfer  matrix  from  lower 

states  {aj}  to  upper  states  {b^}.  From  the  defini¬ 
tions  (3.5),  (3.6)  and  (3.8)  and  the  regular 
condition  (3.3),  we  have 


^k{l}  ~  ~ 

~  Tr  ”  ^n)  '  '  ■  ^Ok  '  '  '  ^Oli^k  ~  -^i) 

=  - 1  - 1 )  •  •  •  - -'l ) 

^  ^k\i^k  ~  ^n)  '  '  '  ^kk  +  li^k  ~  +  ■ 

(3.12) 


^  d 

=  2jjiRk,(Xk-Xi;v)  . 

*  =  1,2,...,N.  (3.13) 

We  can  regard  {//*}  as  commuting  Hamilto¬ 
nians.  In  fact,  substituting  the  expression  Z^  = 
I  +  Tj//*  +  C(r}^)  into  (3.7),  we  see  that 

[//*,//, ]  =  0,  *,/=!, 2,. ..,N.  (3.14) 

This  indicates  that  the  Hamiltonian  system 

\ 

H  =  2  <^kffk  ’  a*:  constants  ,  (3.15) 

*=i 

is  completely  integrable. 

It  is  interesting  to  compare  formula  (3.13) 
with  Baxter’s  formula  (2.2).  While  Baxter’s  for¬ 
mula  gives  a  spin  Hamiltonian  with  nearest- 
neighbor  interactions,  the  formula  (3.13)  yields  a 
spin  Hamiltonian  with  long-range  interactions. 

The  above  discussion  is  general.  We  have  used 
only  the  Yang-Baxter  relation  and  the  properties 
of  the  f?-matrix  such  as  the  regular  condition,  the 
quasi-chemical  condition  and  the  inhomogeneity. 
Given  an  /?-matrix,  we  have  commuting 
Hamiltonians  {//*}  by  formula  (3.13).  For  in¬ 
stance,  from  the  /?-matrix  for  the  spin-|  XYZ 
model,  we  have 

~  '  ,?/  sn(x,  -  X,) 

X  {[1  -I-  A  sn^(x,  -  Xy)]o''crJ 
+  (1  -ksn\x,-x^)]fT^(r^ 
cn(x,  -  Xj)  dn(x,  -  Xj)  -  1)}  ,  (3.16) 

where  k  is  the  modulus  of  Jacobi’s  elliptic 
functions.  Applications  to  other  models  includ¬ 
ing  the  spin-1  model  have  been  discussed  [10,11]. 

4.  Quantum  integrable  particle  systems 


Differentiating  the  last  expression  and  using  the 
quasi-classical  condition  (3.4),  we  obtain 


The  quantum  inverse  scattering  method  for 
A^-particles  on  a  line  may  be  introduced  as 
follows.  Let  L  and  M  be  N  'xN  matrix 
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operators.  We  choose  L,  M  (Lax  pair)  such  that 
the  Lax  equation 

L  =  i[H,L]  =  i[L,M],  (4.1) 

is  equivalent  to  the  equation  of  motion  gener¬ 
ated  by  a  Hamiltonian  H  under  consideration. 
We  associate  an  evolution  of  “eigenstate”  as 

iO  ^[U,H]  =  MU  .  (4.2) 

Then,  from  (4.1)  and  (4.2),  we  have 

[H,U''LU]  =  0.  (4.3) 

Equation  (4.3)  is  a  quantum  version  of  the 
unitary  equivalence  by  P.  Lax.  For  classical 
systems,  the  condition  that  U~^LU  does  not 
depend  on  time  leads  to  the  existence  of  N 
conserved  quantities  {/„}, 

l„^~TT{U-'L'-U)  =  lTrL" ,  (4.4) 

where  Tr  means  the  trace  of  matrix.  For  quan¬ 
tum  systems,  since  L  and  U  are  operators,  the 
last  equality  in  (4.4)  is  not  guaranteed. 

To  be  specific,  we  restrict  our  discussion  to  the 
Calogero-Moser  system  whose  Hamiltonian  is 
given  by 


N 


^  =  7  S  P-  +  ig  S  yiXj  -  x^) , 

(4.5) 

(4.6) 

The  Lax  pair  is  found  to  be 

^ik  =  +  ifl(l  -  5y*)  a(Xy  -  X*) , 

(4.7) 

iWy*  =  a(l  -  5y*)  /3(Xy  “  X* )  -H  flSy*  ^  jiX J 

(4.8) 

where  a  is  a  constant  related  to  the  coupling 
constant  g  by  g  =  -  a,  and 

a(x)=l/x,  /3(x)  =  -l/x^. 

(4.9) 

Calogero,  Ragnisco  and  Marchioro  [12]  pre¬ 
sented  a  set  of  conserved  operators  (rigorously 
speaking,  their  potential  is  1/sinh^x  instead  of 


1/x^,  but  the  discussion  is  the  same).  They 
proved  that  with  g  =  a^, 

[W,  det  yl]  =  0  ,  (4.10) 

where  the  matrix  A  is  an  N  x  N  matrix  defined 
by  The  expansion  of  detyl  in 

powers  of  A  gives  a  set  of  conserved  operators 

R}: 

det(yl^  J  =  det[L,*  -  A5^  J 

N 

=  (-A)^+S(-A)^-R.  (4.11) 

rt  =  1 

lliose  conserved  operators  {7„}  have  the  same 
functional  forms  as  the  classical  ones.  Only  the 
difference  is  that  Pj  is  the  operator  in  the 
quantum  case.  To  carry  out  the  proof  of  (4.10), 
they  modified  (4.1)  into 

[H,  Ly*]  ~  2  S 

-M,L„-L„M,).  (4.12) 

However,  the  symmetrized  commutator  in  the 
r.h.s.  of  (4.12)  needs  a  justification  from  a 
viewpoint  of  the  quantum  inverse  scattering 
method.  We  therefore  believe  that  their  proof  is 
not  satisfactory,  although  the  conserved 
operators  {y„}  are  widely  accepted. 

An  alternative  method  to  find  conserved 
operators  may  be  the  following.  From  (4.1),  we 
have 

[//,  iL%]  =  S  [{L"),M,,  -  M.in,,] .  (4. 13) 

Explicit  form  of  A/y*  in  (4.8)  gives 

SMy*  =  0,  Sa/,*  =  0.  (4.14) 

/  * 

Then,  we  find  from  (4.13)  and  (4.14)  that 

[//,Z(L%1=0,  (4.15) 

^  j.k 

that  is,  conserved  operators  {/„}  are  given  by 
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n  =  l,2,...,N. 

j,k 


(4.16) 


Expression  (4.16)  is  new.  The  first  three  of  the 
conserved  operators  are 


^1  2  L,k  2  P,  , 


t.k 


4  -  I  2  (E‘)^*  -  3  S  p;  +  ig  S  .  -r  ' 
/  >»**  ^k) 


hk 


i.k 


Ik 


p;+igS  {piT^ 

I  l*k  ^  [Xi  - 


1  1 
V  -  V  Pk  V  -  r  + 


1 


x-x,-'^x,-x,  {x^-x,) 


IP.)  -  (4.17) 


As  we  expected,  g  appears  in  {/„}.  While  /,  and 
/j  are  the  total  momentum  operator  and  the 
Hamiltonian,  ^  is  a  non-trivial  conserved 
operator.  In  general,  /„  =  1/nE  p"  +  •  •  •  is  a 
polynomial  in  Py  and  \Hx^  ~^k)- 
We  have  found  that  the  Lax  pair  gives  an 
interesting  algebra.  Let  us  introduce  a  set  of 
operators. 


2  ^k/ 
k 

k 


-  p,  +  w  2 

r  J  Y  —  r 

k^j 

(4.18) 

-  p  +  in  2 

r  J  V  —  1* 

k^i  ^k 

(4.19) 

The  operators  h]  and  are  Hermitian  conju¬ 
gated  to  each  other.  They  satisfy  the  commuta¬ 
tion  relations 


[hi^hj  =  0,  [h],hl]  =  0, 

[h„hl]  =  2M,„,  /,m  =  l,2 . N.  (4.20) 

A  similarity  between  (3.14)  and  (4.20)  is  intrigu¬ 
ing.  In  terms  of  these  operators,  the  total 
momentum  operator  /,  and  the  Hamiltonian  I2 
are  simply  expressed  as 


/,  =  l2  =  i'Zh]h,. 


k  =  l 


k  =  \ 


1=1 


Further  detailed  analysis  of  long-range  integra¬ 
ble  Hamiltonian  systems,  in  particular,  the 
Calogero-Moser  system  with  spins  will  be  dis¬ 
cussed  elsewhere  [13]. 
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For  some  values  of  the  grid  resolution,  depending  on  the  nonlinearity,  the  discrete  nonlinear  Schrodinger  equation  with 
arbitrary  power  nonlinearity  can  be  approximated  by  the  corresponding  continuum  version  of  the  equation.  When  the 
discretization  becomes  too  coarse,  the  discrete  equation  exhibits  localization  in  regimes  where  blow-up  cannot  occur  in  the 
continuum  system.  This  phenomenon  is  investigated  numerically,  and  the  grid  resolution  at  which  the  transition  occurs  is 
determined. 


1.  Introduction 

The  one-dimensional  discrete  nonlinear  Schro¬ 
dinger  equation  with  arbitrary  power  nonlineari¬ 
ty,  O', 

w,  +  -  W,  +  (/,_,)  +  =  0  , 

(1) 

describes  a  system  of  /coupled  anharmonic  oscil¬ 
lators,  7  = -H/-- 1).  •  •  ■  .  K/- 1)  (/  odd). 
The  dispersive  coupling  has  the  strength  h~^, 
where  h  is  the  distance  between  adjacent  sites. 
Eq.  (1)  is  a  discretization  of  the  nonlinear  Schro¬ 
dinger  equation  with  arbitrary  power  non¬ 
linearity 

+ 'I'xx  +  ^  ■  (2) 

For  high  grid  resolution  (high  values  of  /)  the 
solution  to  eq.  (1)  is  expected  to  resemble  the 
solution  to  eq.  (2).  It  is  the  aim  of  this  paper  to 
study  to  what  extent  and  in  which  areas  of 
parameter  space  (/,  a)  this  is  true.  In  particular 
we  concentrate  on  the  phenomenon  of  localiza¬ 
tion  or  self-trapping  [1],  that  is,  the  dynamically 
stable  state  in  which  most  of  the  energy  of  the 
system  gets  concentrated  at  a  single  site.  The 


continuous  counterpart  of  localization  is  blow-up 
[2],  which  designates  the  situation  where  the 
maximum  of  |(/r|  tends  to  infinity  in  finite  time. 
Blow-up  has  been  studied  extensively  in  the  last 
decades,  and  has  now  been  observed  both  ex¬ 
perimentally,  analytically  and  numerically  in 
many  physical  systems.  Most  famous  is  presum¬ 
ably  the  Langmuir  wave-collapse  in  plasma 
physics  [3]  and  self-focussing  of  laser  beams  in 
nonlinear  optics  [4].  Furthermore  it  has  been 
used  to  describe  energy  transfer  in  molecular 
thin  films  [5]. 


2.  The  discrete  and  the  continuum  model 

The  discrete  model,  eq.  (1),  has  only  the  two 
conserved  quantities  the  number,  N,  and  the 
Hamiltonian,  H, 

N  =  h'Z  \U,W 

+/' 

//=-/!  E  + 

(3) 

where  /'  =  i{f  +  1).  Thus,  eq.  (1)  is  not  integra- 
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ble  when  the  system  has  more  than  two  degrees 
of  freedom.  As  a  measure  of  localization  (or 
self-trapping),  we  introduce  the  mean  width  of  a 
pulse,  W,  defined  as  where  the 

virial,  /,  is  given  by  [2] 

i' 

l  =  h^  2  y  corresponding  to 

/= 

+  3C 

7=1  .  (4) 

—  30 

Here  the  pulse  center  is  assumed  to  be  at  site 
j  =  0,  corresponding  to  Jt  =  0.  Thus  for  a  com¬ 
plete  localization  of  the  energy  at  the  center  site, 
W  becomes  zero.  The  virial  theory  [2]  is  a  classi¬ 
cal  method  to  determine  whether  a  given  initial 
waveform  will  collapse  in  a  finite  time  (7^0  for 
^0<“).  or  not. 

Corresponding  to  N  and  77  given  by  eq.  (3), 
the  continuum  eq.  (2)  has  the  conserved  quan¬ 
tities 

■f  oe 

f  kl'djc, 

—  30 
+  00 

//=  /  (l>/tf-(l  +  <rr'\,ltr^^)dx.  (5) 

—  30 

As  the  ground  state  solitary  solution  to  eq.  (2), 
with  zero  velocity  and  centered  at  jr  =  0,  we  find 

i/t^(x,  (;  ca)  =  w' "(1  +  sech' '^(tvo’x)  e““  '  , 

(6) 

where  w  is  a  real  parameter.  From  general 
studies  it  follows  that  the  solution,  given  by  eq. 
(6),  is  unstable  and  blows  up  at  the  center  if 
cr  >  2,  and  is  unconditionally  stable  for  tr  <2  [2]. 
The  different  regimes  of  a  are  denoted  the  sub- 
critical  (o-<2),  critical  (ff  =  2)  and  supercritical 
regime  (o->2),  respectively.  In  the  critical  case 
an  arbitrary  initial  condition  will  blow-up  if  the 
requirement  A/  >  N,  is  fulfilled.  Here 
where  N{iJ/)  is  given  by  eq.  (5). 


3.  Numerical  results 

Eq.  (1)  is  integrated  numerically  with 
tl/^(x,  r;  o)  =  1),  given  by  eq.  (6),  as  initial  con¬ 
dition 

t/^(0)  =./»,(  ;7i,0;a)  =  l).  (7) 

A  fourth  order  Runge-Kutta  scheme,  with  time 
step  d^=10~^  is  used  for  the  integration 
throughout  the  paper.  The  value  10”*  of  dr 
assures  the  conservation  of  N  and  77  within  10“^ 
of  their  initial  values  for  all  runs.  The  most 
important  results  found,  have  been  checked  by 
repeating  them  with  dr  =  5  x  10”^  and  dr  =  5  x 
10” ^  and  by  applying  another  integration 
scheme,  the  Burlirsch-Stoer  method  with 
Richardson  extrapolation  [6],  to  assure  that  they 
are  independent  of  the  time  integration.  N,  77,  7 
etc.  are  checked  at  time  intervals  Ar=  10”^  and 
the  length  of  the  jc-interval,  L  =  {f-  \)h,  has 
the  value  25.6  for  results  presented. 

Fig.  1  shows  the  time  evolution  of  the  am¬ 
plitude  Ay  =  \Uj\,  for  different  values  of  cr  and /. 
For  (7  =  1.90  and  /=185  (fig.  la)  the  pulse  is 
stable.  This  is  also  predicted  from  the  continuum 


Fig.  1.  Amplitude  of  the  solution  to  eg  (I)  and  (7)  with 
Z.  =  25.6.  (a)  (7=  1.90,  /=  185,  (b)  cr  =  1.95.  /=  1.85,  (c) 
(7=  1.95,7=191,  (d)  (7  =  2.00,  7  =  501. 
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system,  eq.  (2),  since  a  =  1.9  is  a  subcritical 
value.  However,  a  small  oscillation  of  the  am¬ 
plitude  and  width  of  the  pulse  is  noticed.  Fig.  lb 
shows  that  most  of  the  energy  becomes  localized 
at  the  center  site  when  tr  is  increased  to  1.95. 
This  contrasts  the  stability  of  the  ground  state 
solution  for  the  continuum  model,  in  the  sub- 
critical  regime.  The  observed  localization  is  not 
complete  (Wt^O)  since  not  all  of  the  energy  is 
concentrated  at  the  center  site.  Actually,  for  the 
initial  conditions  used  in  our  numerical  simula¬ 
tions,  the  state  of  complete  localization  cannot 
be  reached.  There  will  always  be  a  certain 
amount  of  energy  at  the  neighbouring  sites. 
However,  a  final  state  as  in  fig.  lb  will  be 
characterized  as  localized  in  the  following.  In  fig. 
Ic,  /  is  raised  to  191  while  a  is  still  1.95.  It  is  seen 
that  this  decrease  of  h  prevents  localization  of 
the  energy.  Instead  periodic  oscillations  occur. 
These  are  of  the  same  type  as  in  fig.  ..  We  have 
checked  that  they  continue,  at  least  until  /  =  100 
(further  increase  of  /  leads  to  a  decrease  of  the 
oscillations).  In  fig.  Id,  cr  is  raised  to  the  critical 
value  2,  where  blow-up  occurs  in  the  continuum 
system.  Correspondingly  localization  is  ob¬ 
served,  despite  the  fine  resolution  /=501.  We 
note  that  the  localization  for  o-  =  1.95  (fig.  lb)  is 
due  to  the  discreteness  of  the  system,  when  the 
oscillation  pulse  is  most  narrow  it  cannot  be 
resolved.  In  contrast,  the  localization  for  a  =  2 
(fig.  Id)  is  a  reflection  of  the  blow-up  phenom¬ 
enon  in  the  continuum  system.  The  discreteness 
and  ihe  conservation  of  N  prevents  the  am¬ 
plitude  at  a  single  site  to  reach  infinity.  While  the 
localization  for  tr  =  1 .95  can  be  removed  by  in¬ 
creasing  /,  this  is  not  the  case  with  the  localiza¬ 
tion  for  cr  =  2.  Here  the  localization  is  only 
delayed  when  /is  increased.  The  variation  of  the 
oscillations  of  pulse  width  and  amplitude  for 
different  values  of  tr  are  shown  in  more  detail  in 
fig.  2  in  the  subcritical  regime,  a'<2.  Here  the 
time  evolution  of  the  mean  width,  W,  and  the 
center  amplitude,  A„,  are  depicted.  Clearly  both 
the  period  and  the  amplitude  of  the  oscillations 
are  increasing  with  cr.  Furthermore,  the  oscilla- 


Fig.  2.  Width,  W,  and  center  amplitude,  A^,  as  function  of 
time  for /=  257  and  L  =  25.6.  (7  =  1.5,  1.6,  1.7,  1.8,  1.9. 


tions  of  W  and  A„  are  90°  out  of  phase,  due  to 
conservation  of  N.  Fig.  3  shows  the  variation  of 
the  oscillations  for  different  values  of  /,  the 
number  of  sites  in  the  interval  L  =  25.6.  The 
nonlinearity,  tr,  has  the  subcritical  value  1.95. 
For  /  ^  1 .87  the  time  evolution  has  two  dis¬ 
tinguishable  states:  (i)  First  there  is  a  monotoni- 
cal  decrease  (increase)  of  W  (A^),  resembling 
the  initial  phase  of  a  blow-up  in  the  continuum 
system.  The  first  minimum  (maximum)  of 
W  (A„)  is  delayed  as  /  is  increased.  We  note 
again  that  W  never  reaches  zero,  but  has  a 
minimum  value  approximately  equal  to  the  dis¬ 
tance  between  sites,  h.  (ii)  Then  follows  sus¬ 
tained  rapid  oscillations.  The  mean  value  of  this 
final  oscillation  increases  with  /,  while  the  period 
decreases  with  /.  For  these  /-values  the  time 
evolution  is  of  the  same  type  as  shown  in  fig.  lb. 

At  /=  191  an  abrupt  transition  has  occurred. 
There  is  no  longer  localization.  Instead  the  pulse 
performs  a  slow  periodic  oscillation,  of  the  same 
type  as  seen  in  fig  la  and  fig.  Ic.  We  shall 
denote  this  transition  value  of  /,  by  /.  Increasing 
/  to  501,  the  oscillation  are  still  present,  but 
much  weaker. 


Fig  3.  as  function  of  time  for  <t  ~  i.95  and  L  =  25.6. 
/=  171,  187.  19!.  201. 
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Fig.  4.  W"  and  (defined  in  text)  as  function  of  /  with 

L  =  25.6.  (-  •  -)  tr  =  1.95  and  ( - )  a  =  1.99.  Initial  values 

oi  W"  and  are  shown  as  dashed  lines. 


Fig.  5.  Amplitude  of  the  solution  to  eq.  (1)  with  /=  107, 
cr  =  2  and  L  =  25.6.  The  amplitude  of  the  initial  condition, 
given  by  eq.  (7),  is  reduced  by  a  factor  (a)  1.000,  (b)  0.994, 
(c)  0.993. 


These  results  are  summarized  in  fig.  4.  Here  r™ 
is  defined  as  the  value  of  t  at  which  W  reaches  its 
first  minimum.  Correspondingly,  W"  =  W(f'") 
and  i4o  =  y4o(t'").  Results  for  two  subcritical 

values  of  cr,  cr  =  1.95  ( - )  and  cr  =  1.99  ( - ), 

are  shown. 

For  f  increases  with  /.  Here  f"’  is  the 

time  of  localization.  For  decreases 

rapidly  towards  an  asympF<tic  value.  Here  2f"*  is 
the  period  of  the  slow  oscillations. 

For  /  </,,  W”  decreases  approximately  as  1//, 
again  implying  that  the  width  is  bounded  from 
below  by  the  distance  between  sites,  h.  For 
increases  asymptotically  towards 
W(0),  indicated  by  dashed  line.  A  corresponding 
behavior  is  observed  for  A™.  (The  randomness 
for  /  <  /,  is  due  to  limited  time  resolution  in  the 
plotting  Af  =  10”^). 

In  fig.  5  cr  =  2  (the  critical  value)  and  /=  107. 
Fig.  5a  shows  the  time  evolution  of  the  am¬ 
plitude,  with  the  initial  condition  given  by  eq. 
(7).  Localization  is  seen  to  occur.  In  fig.  5b  the 
initial  amplitude  is  reduced  by  a  factor  0.994. 
Localization  still  occurs,  but  delayed  with  respect 
to  the  localization  in  fig.  5a.  In  fig.  5c  the  initial 


amplitude  is  reduced  by  a  factor  0.993.  Here  the 
pulse  no  longer  localizes,  but  disperses  slowly. 

In  the  continuum  model  the  criterion  for  blow¬ 
up  of  an  arbitrary  initial  condition  is  = 

jttVS  in  the  critical  case.  Fig.  5  shows  that  in 
the  discrete  system,  for  /=  107,  the  limiting 
value  of  N  is  reduced  from  to  0.988 -A/,. 
Decreasing  /  further,  this  limiting  value  will  de¬ 
crease  correspondingly. 


In  this  paper  we  have  studied  numerically  the 
relation  between  localization  in  the  discrete  non¬ 
linear  Schrodinger  equation  with  arbitrary  power 
nonlinearity,  and  blow-up  in  the  corresponding 
continuum  equation.  It  is  found  that  localization 
may  occur,  even  for  subcritical  values  of  the 
degree  of  nonlinearity.  A  sharp  transition  be¬ 
tween  localization  and  smooth  oscillatory  be¬ 
havior  occurs.  A  reduction  of  the  initial  am¬ 
plitude  is  shown  to  be  equivalent  to  a  reduction 
of  the  nonlinearity. 


4.  Conclusion 
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Stationary  state  energy  levels  of  the  quantum  discrete  self-trapping  equation  (DST)  dimer  are  computed  numerically 
and  estimated  analytically.  Quantum-mechanical  manifestations  of  a  self-trapping  transition  that  occurs  in  the  corre¬ 
sponding  classical  DST  dimer  are  found  in  both  quantum  energy  levels  and  quantum  wavefunctions.  The  work  is  motivated 
by  applications  of  the  DST  equation  as  a  model  for  local  vibrational  modes  in  polyatomic  molecules. 


1.  Introduction 

Clarifying  the  influence  of  quantum-mechani¬ 
cal  effects  on  the  behavior  of  classically  non¬ 
linear  dynamical  systems  is  an  important  goal  for 
future  nonlinear  science  research.  A  mathemati¬ 
cal  model  particularly  well-suited  to  this  research 
area  is  the  discrete  self-trapping  equation  (DST), 
introduced  in  1985  by  Eilbeck,  Lomdahl  and 
Scott  [1],  The  DST  is  a  nonlinear  and  generally 
non-integrable  classical  dynamical  system  of  / 
coupled,  anharmonic  oscillators  and  is  uncom¬ 
monly  easy  to  quantize.  Already  the  DST  has 
been  useful  in  studies  of  quantum  manifestations 
of  nonlinear  classical  phenomena  such  as  soliton- 
like  energy  localization  [2,3]  and  deterministic 
chaos  [4,5].  The  DST  is  also  a  useful  model  for 
molecular  bond  vibrations  [6-8]. 

In  the  special  case  of  just  two  degrees  of 
freedom,  that  DST  equation  takes  the  form 

7  1  |2 

=  a,  , 

•  I  12 

-ykl  «2 .  (1) 

where  is  the  complex  amplitude  of  vibrations 
of  the  yth  bond,  (o„  is  the  frequency  of  uncoup¬ 
led,  low  amplitude  oscillations  and  the  real 
parameters  J  and  y  give  the  magnitudes  of  linear 


intersite  coupling  and  on-site  anharmonicity, 
respectively.  The  effective  nonlinearity  is 


where  N  =  la,|^  -b  Both  the  norm  N  and  the 
Hamiltonian 


irda.r  +  -  J{a*a,  +  a*a,) 

{*  denotes  the  complex  conjugate)  are  conserved 
by  eq.  (1);  for  two  freedoms  the  DST  is  an 
integrable  system. 

An  important  class  of  solutions  of  eq.  (1)  are 
the  stationary  solutions,  for  which 

a,.  =  a,.(0)e““',  (3) 

with  the  same  constant  frequency  o)  for  all  j  [1]. 
For  0  ^  f  <  2,  there  are  two  stationary  solutions: 
one  is  symmetric  (a,  =02)  and  the  other  is  anti¬ 
symmetric  (a,  =  -02);  each  distributes  energy 
equally  between  the  two  sites.  However,  at  ^  =  2 
there  is  a  bifurcation.  For  ^  >2  the  symmetric 
stationary  solution  is  unstable  and  two  new 
stable  solutions  are  found.  As  (  increases,  these 
two  other  stationary  solutions  become  the  lowest 
in  total  energy  and  each  concentrates  more  and 
more  energy  on  one  of  the  two  sites.  For  these 
two  local  mode  stationary  solutions  [1], 
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f  ? 

Fig.  1.  Quantum  and  classical  transitions,  (a)  The  energies  of  stationary  solutions  of  eq.  (1)  vs.  (  for  /V  =  20,  7  =  1,  and  <v„  =  0. 
The  dashed  curve  indicates  that  the  symmetric  stationary  solution  is  unstable  above  (  =  2.  (b)  Quantum  energy  levels  vs.  f  for  the 
quantum  level  n  -20,  7  =  1  and  tag  =  0. 


(4) 

Although  the  two  oscillators  are  identical,  eq. 
(4)  shows  that  above  the  { =  2  self-trapping 
transition,  vibrational  amplitude  -  and  hence 
energy  -  can  be  localized.  Figure  la  shows  how 
the  total  energies  of  all  the  stationary  solutions 
of  eq.  (1)  vary  with  f. 

This  paper  describes  recent  efforts  to  under¬ 
stand  how  the  classical  self-trapping  transition  at 
^  =  2  is  reflected  in  the  energy  levels  and  station¬ 
ary  state  wavefunctions  of  the  corresponding 
quantum-mechanical  problem  (QDST). 


2.  Quantum  energy  levels 

Because  solutions  of  the  classical  DST  equa¬ 
tion  conserve  the  norm  N,  solutions  of  the 
corresponding  quantum-mechanical  problem 
conserve  the  total  number  n  of  vibrational 
quanta  (bosons).  This  makes  it  easy  to  quantize 
eq.  (1)  [2].  The  classical  Hamiltonian  H  becomes 
the  quantum  Hamiltonian  operator 

-^,y[{B\B,f  +  {B\B,f]-J{B\B,  +  B\B,), 


A  ^  A 

where  5,  (B,)  is  a  quantum  raising  (lowering) 
operator  for  bosons  on  site  i. 

For  each  non-negative  integer  boson  number  n 
there  are  n  -i- 1  stationary  quantum  states  which 
we  label  m  =  0, .  .  .  ,/r.  These  states  are 
represented  in  the  {n  -f  l)-dimensional  basis  of 
product  states  |ni)|rt2)>  where  |n,)  is  the  eigen¬ 
function  of  a  harmonic  oscillator  at  site  /  with 
quantum  number  n,,  and  n,  ■¥n2  =  n.  We  write 

=  -/>!/>.  (5) 

/  =  0 

Requiring  that  solves  Schrodinger’s  linear 

wave  equation  for  a  particular 

value  of  n,  we  get  an  (n  +  1)  x  (n  -f  1)  matrix 
eigenvalue  problem  for  the  stationary  state 
energies  E*'”’  and  the  corresponding  coefficient 
vectors  cT’,  c["'\  ..., 

m  =  0, . .  .  ,  n. 

Computed  energy  levels  for  n  =  20  are  plotted 
vs.  i  in  fig.  lb.  Near  the  transition  point  ^  =  2, 
the  curves  for  the  lowest  levels  change  slope  and 
combine  into  nearly-degenerate  pairs.  The  classi¬ 
cal  bifurcation  at  f  =  2  is  clearly  manifested  in 
changes  in  the  corresponding  quantum  energy 
spectrum. 

Analytical  estimates  for  the  n  +  1  energy  levels 
and  eigenfunctions  at  the  nth  quantum  level  in 
the  large-f  limit  can  be  obtained  by  writing  the 
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quantum  Hamiitonian  operator  A  =  Hq  +  Jfl^ . 
Degenerate  regular  perturbation  theory  gives 

=  ("o  -ir)"  -  jylin  -  mf  +  m^] 

^  T  \  n-2m  +  l  ~  n-2m-l  ) 

+  €{J^)  (6) 

for  the  energy  levels,  m  =  0, . .  .  ,n.  The  corre¬ 
sponding  zero-order  wavefunctions  are  also 
found  up  to  G{J). 

Perturbation  calculations  for  small  C  can  also 
be  carried  out,  by  representing  the  problem  in  a 
new  basis  that  diagonalizes  fl  in  the  limit  ^  =  0. 
Writing  H  =  Hq  +  yf^^  and  using  non-degenerate 
regular  perturbation  theory,  we  obtain  for  m  = 
0, .  . .  ,  n  energies 


=  (OQti  +  J{2m  -  n)~  j-yn 


_  1 


y^(2m  -  n) 


^y{n  +2nm- 2m  )  +  ■  ^2/ 

x[2m(n  -  m)  +  n  -  1]  + €(y^)  (7) 


along  with  the  «  -i- 1  corresponding  wavefunc¬ 
tions  up  C(7). 

In  using  coupled  anharmonic  oscillator  models 
such  as  the  QDST  to  describe  local  modes  in 
molecules,  the  lowest  energy  levels  for  a  given 
value  of  n  are  especially  significant;  these  levels 
correspond  to  the  predominant  peaks  in  ex¬ 
perimental  absorption  spectra  [9].  In  our  nota¬ 
tion  for  both  large-  and  small-^  perturbation 
theories,  the  lowest  level  for  a  given  n  is  In 
fig.  2  the  small  y  perturbation  result,  the  small  J 
perturbation  result,  and  the  exact  (numerical) 
value  of  are  plotted  vs.  ^  for  n  =  5.  For 
C  <2,  the  small-y  perturbation  result  and  the 
exact  result  are  so  close  that  they  appear  as  a 
single  line;  the  same  is  true  for  the  small-/ 
perturbation  result  and  the  exact  for  {  >2. 
Patching  together  the  two  second-order  per¬ 
turbation  theory  results  appears  to  give  a  good 
approximation  for  the  level  for  all 


Fig.  2.  £'°’  vs.  C:  exact  value  ( . ),  small  y  perturbation 

theory  ( — )  and  small  J  perturbation  theory  ( . ).  The 

parameters  cuo  =  3000  and  J  =  100  and  the  quantum  level 
n  =  5  used  here  are  realistic  for  modeling  carbon-hydrogen 
stretch  vibrations  in  molecules. 


3.  Energy  level  splittings 

For  and  n>2,  the  two  lowest-energy 
quantum  eigenstates  for  a  given  n  are  nearly 
degenerate  and  can  combine  to  form  a  long- 
lived,  energy-localizing  wavepacket  [3].  Here  we 
consider  more  general  values  of  In  fig.  3,  the 
energy  splitting  A£  between  the  lowest  two 
energy  levels  of  the  QDST  dimer  is  given  as  a 
function  of  f  for  several  n  values.  These  numeri¬ 
cal  results  suggest  that  in  the  classical  limit 


Fig.  3.  Computed  splitting  A£  between  the  two  lowest 
QDST  energy  levels  for  a  given  n  vs.  (  for  (from  right  to 
left)  n  =  6.  12,  25,  50,  100,  200  and  400,  7  =  1  and  =  0. 
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n  ^  00,  the  two  lowest  levels  become  degenerate 
precisely  at  the  classical  bifurcation  point  ^  =  2. 

In  the  large  C  limit,  a  useful  formula  for  the 
energy  splitting  AC  is  [3] 


Numerical  tests  [3]  have  shown  that  for  fixed 
J  ^y,  the  exact  AE  converges  to  the  expression 
in  eq.  (8)  for  large  n.  It  has  been  conjectured 
that  eq.  (8)  is  good  for  f  >  2. 

Using  Stirling’s  approximation  for  n\  in  eq. 
(8),  one  can  show  that  for  large  n,  the  exact  Af 
diverges  from  the  right  side  of  eq.  (8)  for  ^  e  » 
2.718.  Thus  eq.  (8)  should  apply  to  the  ODST 
model  for  quantum  levels  n  >  eJ/y.  This  has 
been  verified  numerically.  Thus,  the  approxi¬ 
mation  in  eq.  (8)  breaks  down  just  above  the 
classical  bifurcation  point. 


4.  Wavefunctions 

Further  insight  into  the  quantum-mechanical 
dynamics  of  the  QDST  dimer  can  be  obtained  by 
considering  the  quantum  wavefunctions.  How¬ 
ever,  the  problem  of  finding  a  quantum-mechani¬ 
cal  analog  of  classical  local  modes  is  complicated 
by  a  quantum-mechanical  symmetry  condition; 
the  nondegenerate  stationary  quantum  states  for 
systems  of  two  identical  oscillators  must  be 
symmetric  or  anti-symmetric  with  respect  to 
exchanges  of  the  two  sites.  Here  we  handle  this 
complication  in  two  ways:  (i)  by  comparing 
classical  local  mode  stationary  solutions  to  non¬ 
stationary  quantum  wavepackets  and  (ii)  by 
examining  a  certain  property  of  the  properly- 
symmetrized  stationary  quantum  states  that  is 
related  to  classical  energy  localization. 

4.1.  Local  mode  wavepackets 

Nonstationary  energy-localizing  quantum 
wavepackets  can  be  created  from  superpositions 


of  the  stationary  states.  For  example,  the  lowest- 
and  second-lowest-energy  stationary  states  for  a 
given  value  of  n  are  symmetric  and  anti-symmet¬ 
ric,  respectively.  A  non-stationary  wavepacket 
created  by  adding  these  two  stationary  states  can 
initially  concentrate  energy  on  one  of  the  two 
sites  [3].  The  degree  of  localization  in  such  a 
non-stationary  wavepacket  can  be  characterized 
by  the  value  of 

{b;b,-b;b,)u,  (9) 

where  f  =  0  is  the  initial  time.  This  is  the  quan¬ 
tum  analog  of  the  classical  quantity  Iflil'  ~  |a2p- 
Numerical  calculations  for  the  quantum  problem 
suggest  that  one  can  create  for  quantum 
wavepackets  for  which  the  value  of 
^2^2)lr=o  approaches  the  right  side  of  eq.  (4)  as 
n—*x.  In  this  way  the  initial  degree  of  localiza¬ 
tion  in  such  quantum  wavepackets  approaches 
the  degree  of  localization  in  the  corresponding 
classical  local  mode  stationary  solutions. 

Creating  a  wavepacket  that  initially  concen¬ 
trates  energy  on  site  1  requires  that  symmetric 
and  anti-symmetric  stationary  states  be  added 
with  a  specific  relative  phase  at  t  =  0.  Because 
the  two  stationary  states  have  slightly  different 
energies  and  hence  evolve  at  different  charac¬ 
teristic  frequencies,  the  phase  relationship  estab¬ 
lished  at  t  =  0  gradually  changes  and  energy  is  no 
longer  concentrated  on  the  original  site.  For  ^ 
above  the  self-trapping  transition,  the  time  At  for 
which  the  energy  remains  concentrated  on  the 
initial  site  increases  with  increasing  n,  because 
the  energy  splitting  AE  between  the  two  station¬ 
ary  state  components  decreases  (see  fig.  3).  For 
large  n  and  ^  ^  2,  the  initial  phase  relation  takes 
a  long  time  to  decay.  Such  quantum  wavepackets 
are  the  analogs  of  the  classical  local  modes. 

4.2.  Stationary  state  wavefunctions 

There  are  a  number  of  reasons  to  consider 
properties  of  QDST  wavefunctions  other  than 
the  characteristics  of  the  local  mode  wavepackets 
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discussed  above.  First,  it  is  desirable  to  under¬ 
stand  how  all  types  of  classical  solutions  (not  just 
local  mode  stationary  solutions)  of  eq.  (1)  are 
connected  to  exact  solutions  of  the  corre¬ 
sponding  quantum  problem.  Also,  it  is  important 
from  the  point  of  view  of  applications  to  consider 
the  case  where  a  system  remains  in  a  superposi¬ 
tion  of  stationary  states  for  a  relatively  long  time 
but  external  forces  lead  to  a  rapid  randomization 
of  the  phase  relations  between  the  wavepacket 
components  [10];  in  such  cases  the  properties  of 
the  stationary  states  are  more  significant  than  are 
wavepacket  properties  (such  as  (9))  that  depend 
on  the  initial  phase  relations.  This  motivates  an 
examination  of  how  the  individual  quantum 
stationary  states  change  as  the  anharmonicity 
parameter  ^  is  tuned  through  the  self-trapping 
transition. 

It  is  useful  to  characterize  stationary  quantum 
states  1^'""  using  the  quantum  analog  of  the 
classical  time-average  of  l|a,|^  -  The  ana¬ 
log  for  stationary  quantum  states  is  the  diagnos¬ 
tic 

(10) 

"  >=0 

a  measure  of  the  average  difference  between  the 
excitation  levels  of  the  two  sites  in  the  dimer  that 
does  not  distinguish  which  site  is  more  highly 
excited.  The  values  of  D*'"’  are  easily  calculated. 
The  number  cj'”’  in  eq.  (10)  is  the  coefficient  of 
in  the  basis-state  expansion  of  the 
stationary  wavefunction  </>*'"’.  The  factor  \n  -2j\ 
gives  the  magnitude  of  the  difference  between 
the  quantum  level  at  site  1  and  the  quantum 
level  at  site  2  in  that  basis  state.  A  value  of 
near  1  indicates  that  when  the  system  is  in  the 
stationary  state  there  is  a  high  probability 
that  one  site  is  more  excited  than  the  other. 

In  fig.  4,  D'™’  is  plotted  vs.  f  for  each  of  the 
21  stationary  quantum  states  for  the  case 
n  =  20.  Note  that  D*'"’  can  be  nonzero  even 
when  ^  =  0;  the  stationary  quantum  states  with 
the  smallest  values  of  D'""  for  ^  =  0  correspond 


0  2  4  6  8 


? 

Fig.  4.  Solid  lines:  the  localization  measure  0“"’  vs.  f  for  the 
21  stationary  quantum  states  at  the  quantum  level  n  =  20, 
7=1,  (K„  =  0.  Dashed  line:  ||a,|-|a2ll  for  classical  local 
mode  stationary  solutions. 

to  the  classical  symmetric  and  anti-symmetric 
normal  modes.  The  stationary  quantum  states 
with  the  largest  values  of  Z)*"*’  when  ^  =  0 
correspond  to  non-stationary  classical  solutions 
in  which  100%  of  the  energy  oscillates  between 
the  two  sites. 

As  C  increases  above  2,  pairs  of  stationary 
states  develop  large  values  of  D‘'"\  These  pairs 
of  stationary  state  wavefunctions  are  those  with 
which  one  can  build  an  energy-localizing  quan¬ 
tum  wavepacket;  they  correspond  to  the  pairs  of 
nearly-degenerate  energy  levels  at  the  bottom  of 
fig.  lb.  For  large  values  of  n,  the  uppermost 
solid  curve  in  fig.  4  for  f  ^  2  is  well-approxi¬ 
mated  by  the  absolute  value  of  the  right  side  of 
eq.  (4)  with  N  =  n.  The  classical  expression  is 
plotted  with  a  dashed  line  in  fig.  4  for  com¬ 
parison. 

5.  Discussion 

Solutions  of  the  fully  quantum-mechanical 
version  of  the  DST  dimer  reflect  the  classical 
self-trapping  transition  in  a  number  of  different 
ways.  Numerically-calculated  quantum  energy 
levels  manifest  the  classical  self-trapping  bifurca¬ 
tion  most  notably  in  the  quasi-degeneracy  that 
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occurs  above  the  transition  point  between  the 
two  lowest  levels  for  a  given  boson  number  n. 
Carefully  constructed  quantum  wavepackets  and 
even  some  properly-symmetrized  stationary 
quantum  states  (if  characterized  by  the  quantity 
D*"')  echo  the  behavior  of  the  classical  local 
modes  for  { ^2.  The  results  presented  here 
illustrate  connections  between  quantum  and  clas¬ 
sical  behavior  that  are  likely  to  exist  for  other 
anharmonic  oscillator  models  as  well. 

A  main  motivation  for  studying  the  quantum 
mechanics  of  coupled  anharmonic  oscillator 
systems  is  to  obtain  a  better  understanding  of  the 
bond  vibrations  in  polyatomic  molecules.  Fur¬ 
ther  knowledge  about  modeling  bond  vibrations 
is  needed  for  theoretical  chemistry  problems 
such  as  interpreting  molecular  spectra  [11],  un¬ 
derstanding  chemical  reactions  involving  vib- 
rationally-excited  molecules  [12]  and  developing 
techniques  for  bond-specific  laser  photochemis¬ 
try  [13,14]. 

The  DST  equation  has  a  useful  role  in  the 
progress  of  such  research  because,  unlike  some 
other  commonly-studied  anharmonic  oscillator 
systems  used  in  physical  chemistry,  the  DST  can 
be  quantized  without  approximation.  Compari¬ 
sons  between  solutions  of  the  classical  and  quan¬ 
tum  versions  of  the  DST  are  thus  not  clouded  by 
questions  about  the  approximate  nature  of  the 
quantum  solutions.  Further,  the  exact-quantiza¬ 
tion  property  holds  not  only  for  the  DST  dimer 
but  also  for  DST  equations  representing  a  great 
variety  of  coupled-oscillator  systems  such  as  /- 
freedom  oscillator  chains  [15]  and  systems  of 
non-identical  oscillators  [16].  As  a  simple  model 
that  captures  some  fundamental  physics  without 
sacrificing  analytical  tractability,  the  DST  will 
continue  to  provide  insight  into  the  nonlinear 
and  quantum-mechanical  dynamics  of  molecules. 
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Conditions  for  integrability  of  a  modified  discrete  self-trapping  dimer  are  derived.  Dynamical  properties  of  three  special 
cases  are  investigated.  Blow-up  in  finite  time  may  occur  in  a  non-compact  system. 


1.  Introduction 

The  discrete  self-trapping  (DST)  equation  was 
introduced  in  [1]  as  a  general  model  for  energy 
self-trapping  in  molecules  and  molecular  crystals. 
The  DST  equation  is  only  integrable  for  two 
degrees  of  freedoms. 

The  DST  equation  includes  as  a  special  case 
the  discrete  nonlinear  Schrodinger  equation 
which  is  integrable  in  the  continuum  limit  too 
[2].  An  alternative  discretisation  of  the  nonlinear 
Schrodinger  (NLS)  equation  is  the  Ablowitz- 
Ladik  (AL)  equation  [3],  which  is  integrable  for 
any  number  of  freedoms. 

Several  generalisations  of  the  DST  equation 
have  been  studied  in  the  literature,  cf.  [4].  It  has 
recently  been  shown  in  [5]  that  a  Hamiltonian 
and  a  non-standard  Poisson  bracket,  depending 
on  a  deformation  parameter,  can  reproduce  the 
discrete  NLS  equation  and  the  AL  equation  in 
two  different  limits. 

In  this  article  we  consider  the  following  combi¬ 
nation  of  the  DST  equation  and  the  AL  equation 
with  two  degrees  of  freedom  and  periodic 
boundary  conditions: 

U ,  = 

-I-  6,|/1 ,1  Aj  +  ^1^2  , 

U2  =  a2|A,|M2  +  /32|A,|'A,  + 

+  52|A2|^A,  +€2A 


where  A ,  and  A 2  are  complex  functions  of  time, 
dot  denotes  differentiation  with  respect  to  time, 
and  the  coefficients  a, ,  /3, , .  .  .  ,  £2  sre  a  priori 
complex  constants.  Terms  of  the  form  y,|A  ,1^A ,, 
7'2|A2l^A2,  £,^2  and  £2^,  occur  in  the  DST 
system  [1],  terms  of  the  form  5,|AJ^A2, 
52|A2|^A,,  fijAj  and  ejA,  are  found  in  the  AL 
system  [3],  while  terms  of  the  form  ajlAjl^A, 
and  a2!'^il^A2  arise  in  the  coupling  between  two 
pulses  in  optical  fibres  [6,7].  The  remaining 
terms  are  new. 

The  aim  of  this  paper  is  to  study  the  integ¬ 
rability  of  various  special  cases  of  the  general 
system  (1).  In  the  next  section  we  derive  three 
special  cases  which  are  Hamiltonian  and  com¬ 
pletely  integrable.  In  section  three  we  investigate 
the  solutions  in  each  of  the  three  cases.  We  have 
explicitly  shown  that  one  of  the  cases  possesses 
solutions  which  are  not  globally  defined,  i.e., 
they  tend  to  infinity  in  finite  time  (blow-up). 


2.  Deriving  the  special  cases 

The  general  system  (1)  is  only  integrable  when 
it  has  two  conserved  quantities.  We  already  know 
two  special  cases  which  are  integrable,  the  DST 
and  the  AL  case.  The  DST  dimer  has  two 
conserved  quantities,  one  of  which  is  the  norm 
( 1 )  (or  number) 
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=  +  (2) 

Equivalently,  one  of  the  conserved  quantities  for 
the  AL  dimer  is 

+  (3) 

This  last  quantity  is  one  of  the  three  Feynman 

variables  introduced  in  [8],  hence  the  notation 

'•i- 

Since  the  system  (1)  is  more  general  than  the 
DST  and  the  AL,  we  consider  the  remaining  two 
Feynman  variables  too.  These  are 

r^  =  \{A,A\-A\A^),  (4) 

r3  =  lA,|^-|A,^  (5) 

We  then  demand  one  of  these  four  quantities  to 
be  conserved  and  thereby  derive  the  four  sets  of 
conditions  on  the  coefficients  of  (1)  given  in 
table  1.  The  four  quantities  N,  r,,  rj  and  rj 
satisfy  the  following  fundamental  relation; 

N"  =  r\  +  r\  +  r\.  (6) 

Using  this  relation  it  is  possible,  in  each  of  the 
four  cases,  to  rewrite  the  dynamical  system  (1)  in 
the  following  way  (“the  Feynman  top  formal¬ 
ism”) 

L  —  L  'x  to  ,  (7) 

where  L  is  a  three-component  vector  constructed 
from  the  three  remaining  (non-conserved)  quan- 

Table  1 

Conditions  on  coefficients. 

Conserved  quantity  Conditions  on  coefficients 
N  7,,  7-,  real;  Ima,  =  -Iina^; 

r,  7,  =a^  a,  =7^;  ImS,  = -ImSj; 

real;  e,,  fj  real 

r,  7,  =  a  * ,  a,  =  7  * ;  Re  5,  =  Re  82; 

/3,,/32  imaginary;  c,,e2  imaginary 

r,  7,,  72  real;  Im  a,  =  Im  02; 

=  ^,  =  -5*2;e,  =  -e*2 


tities,  and  <u  is  a  three-component  vector  related 
to  L  through  an  affine  map 

fti=C  +  DL,  (8) 

where  C  is  a  constant  vector  and  D  is  a  constant 
matrix.  The  vector  L  in  the  four  cases  is  given  in 
table  2.  In  each  case  L]  +  +  L]  is  constant 

due  to  the  fundamental  relation  (6),  in  agree¬ 
ment  with  eq.  (7).  Having  rewritten  the  dy¬ 
namical  system  in  the  form  of  (7)  and  (8)  it  is 
easy  to  construct  a  second  conserved  quantity 
using  an  analogy  with  classical  mechanics  [9]. 
Generalising  the  kinetic  energy  we  obtain  the 
quantity 

r  =  L^C  +  ^L^DL,  (9) 

where  ^  denotes  transposition.  This  quantity  is 
indeed  conserved,  provided  the  matrix  D  is 
symmetric.  The  conditions  for  symmetry  of  the 
matrix  D  in  each  case  is  reproduced  in  table  3. 
We  now  have  four  different  special  cases  of  (1), 
each  with  two  conserved  quantities.  The  final 
step  is  to  apply  a  gauge  transform  in  order  to 
simplify  the  equations.  The  gauge  transform 
used  is  of  the  form 


Table  2 
The  vector  L. 


Conserved  quantity 

L  =  (L,.  Z.,.  L,) 

N 

''i 

(N,  ir^/ir,) 

'"2 

(Af,  ir,,ir,) 

'■3 

{N,ir,,ir^) 

Table  3 

Conditions  for  two  conserved  quantities. 

Conserved  quantity 

Conditions  for  symmetry  of  D 

N 

a,  real,  =  5, 

r, 

7, ,  72  real.  Re  S,  =  Re  Sj 

''2 

a,,  7,  real,  Im5,  =  -ImS2 

'■3 

a,  real,  /S,  =  -6, 
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I 

Xexp(-i  J  (K,|/l,l'  +  K2|/l2|')d/)  ,  7  =  1,2. 

<0 

where  ly,  k,  and  Kj  are  real  constants.  Note  that 
this  transformation  preserves  the  form  of  the 
four  quantities  N,  r,,  r,  and  r^. 

By  examining  the  simplified  equations  it  is 
easily  shown  that  the  case  r2  =  0  can  be  mapped 
linearly  to  the  case  r,  =  0.  We  are  finally  left  with 
the  three  cases  N  =  0,  r,  =  0  and  Tj  =  0  [lOj.  In 
the  following  we  list  them  in  their  simplified 
form  together  with  the  two  conserved  quantities; 

Case  1 

The  dynamical  equations  are 

'^2  =  y2\^2\^A2  +  €*Ai  , 

where  y,  and  ^2  are  real  constants  and  e  is  a 
complex  constant.  The  two  conserved  quantities 
are 

This  system  reduces  to  the  DST  dimer  [1]  for 
y^  -  ^2  and  e  real.  For  y,  yj  the  system  is 
equivalent  [11]  with  the  non-resonant  DST  dimer 
studied  by  Scott  in  [12].  The  quantum  problem 
for  this  equation  is  studied  in  [13]. 

Case  2 

The  dynamical  equations  are 
^\  =  yi\Ai\^A^  +  eA2  , 

>^2  =  721^21^^2 ’ 

where  y,  and  y2  are  real  constants  and  e  is  a 
complex  constant.  The  two  conserved  quantities 
are 

r3  =  |/l,l'-|>l2l'. 


H = iy.M  .r  -  + «>»m2 + f  . 

It  can  be  shown  [11]  that  this  system  is  a 
specialisation  of  case  3.  In  [14]  a  Lax  operator 
satisfying  the  quadratic  Sklyanin  algebra  [15]  is 
given. 

Case  3 

The  dynamical  equations  are 
*^1  =  ■y|^ll^^2  +  /3ll^2l‘'^2  +  ^1-^2  '“-^1  > 

‘^2  =  Y|^2I^'^1  +^2l'^ll''^l  +  ^2-^  1  "  iO‘A2  , 

where  y,  jS,,  ^2^  ^2  “  ®re  real  constants. 

The  two  conserved  quantities  are 

rt  =  A,A*2  +A*A2, 

//  =  j/32M,r  +  ^^,|/t2r  +  €2|>l,|^  +  e,M2|’ 

+  y\Af\A2\^  +  iaiA,A*2-A\A2). 

For  /3,  =  /32  =  0,  e,  =  62  and  a  =  0  this  system 
reduces  to  the  AL  case  [3].  Choosing  instead 
y  =  j8,  =  /Sj  the  reduced  system  is  equivalent  to 
case  2.  However,  where  the  general  system,  case 
3,  possesses  blow-up  solutions,  this  is  not  so  for 
case  2. 

This  concludes  the  derivation  of  the  three 
special  cases  of  (1). 


3.  Properties  of  the  three  dynamical  systems 

In  this  section  we  will  examine  the  integrabili- 
ty  of  the  three  cases  and  investigate  the  prop¬ 
erties  of  the  solutions. 

All  three  cases  are  Hamiltonian  with  the 
second  conserved  quantity  H  as  the  Hamilto¬ 
nian.  The  Poisson  structure  is  standard  and  the 
canonical  variables  are  (iA*,  iA*,  A,,  A 2), 
(iA*,  A 2,  A,,  iA*)  and  (iA*,  iA*,  A 2,  A,) 
respectively. 

The  three  cases  are  all  completely  integrable 
[16],  Indeed,  defining  N,  =  |A  we  get  in  case 
1: 
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(A/,)- = -[H  -  iy.N;  -  - /V.)Y 

+  4|€|-N,(N-A/,),  (10) 

and  in  case  2; 

(yV,)-  =  -(//- j7,/V;  +  4y,(A/-A/,)-l^ 

+  4\e\-N,{N-N,).  (II) 

So  in  both  cases  (A/,  )Ms  a  fourth  degree  polyno¬ 
mial  in  yv,,  and  hence  the  solution  N^(t)  may  be 
expressed  in  terms  of  Jacobi  elliptic  functions 
[17). 

In  case  1  all  solutions  are  bounded  in  infinite 
time,  since  A/ =  |/4 ,|' +  l/f,!'  ‘s  a  constant.  In 
case  2  the  coefficient  to  N*  in  (11)  is  -5(7,  - 
y^)',  i.e.,  negative.  Hence  all  solutions  are 
bounded  for  Ti  73  •  For  7,  =  7,  the  right-hand 
side  of  (11)  becomes  a  second  degree  polynomial 
in  N^  with  the  highest  coefficient  4|€|'-7ir3. 
The  solutions  to  (11)  are  in  this  case  either 
sinusoidal  or  exponential,  and  hence  they  are 
bounded  in  finite  time. 

However,  in  case  3  the  solutions  can  generally 
not  be  written  in  terms  of  Jacobi  elliptic  func¬ 
tions.  Nor  are  the  solutions  globally  defined,  i.e., 
bounded  in  all  finite  time;  some  solutions  exhibit 
blow-up.  In  the  following  we  investigate  case  3  in 
more  detail. 

By  examining  the  non-compact  manifolds  r ,  = 
const,  and  H  =  const,  we  quickly  derive  sufficient 
conditions  for  bounded  solutions.  These  are 

-  either  7,  jS,  and  P2  have  the  same  sign  ,  ( 12) 

-or/3,^2>7\  (13) 

For  j8,  =  /S,  =  /3  and  e,  =  e,  =  e,  the  dynamical 
equations  for  case  3  can  be  written  as 

N  =  (7  -  +  2ar,  , 

^3  =  -2er,  -  , 

r,  =  (7 -I- /3)Ayr2  +  2aAy  +  2er2  ,  (14) 

where  we  have  reintroduced  the  Feynman  vari¬ 
ables  (2)-(5).  Recalling  that  =  r\  +  r\  +  r\, 
we  find  that 


(Nf  =  {N~  -  r]-  r;)[(7  -  /3)r3  3-  2a]^  ,  (15) 

where  r,  is  related  to  Hamiltonian  through 

H  =  irpN-  +  eN  +  US  -P)rl  +  or, 

+  \{S-P)r]  , 

with  r,  being  the  first  conserved  quantity. 

Simple  calculations  show  that  (N)^  can  be 
expressed  as 

(N)  =  P,{N)  +  R.my/Q.iN) ,  (16) 

where  P^{N)  is  a  fourth-degree  polynomial  in  A/, 
and  /?3(N)  and  Q2il^)  are  second  degree  polyno¬ 
mials.  For  a  =  0,  (A/)‘  becomes  a  fourth-degree 
polynomial  in  N.  Thus,  in  this  special  case  the 
Feynman  variable  N  can  be  written  in  terms  of 
Jacobi  elliptic  functions  [17].  In  the  general  case 
eq.  (16)  must  be  solved  numerically. 

Using  Mathematica  [18]  we  get  the  bounded 


Fig  1.  Solution  of  eqs.  (14)  with  (a)  y  =  2.  /3  -  1.  «  -0.5. 
A1(0)=  1.732.  r,(0)=  1,  r,(0)=  1,  r,  =  1;  (b)  y  =  -1.  ^  =  1, 
€  =  1,  a  =0.5.  N(0)=  1.  r;(0)  =  0.  /■,(0)  =  0.  r,  =  1. 
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solution  shown  in  hg.  la  for  parameter  values 
and  initial  conditions  as  indicated  in  agreement 
with  (12).  Alternatively,  fig.  lb  illustrates  an 
unbounded  solution.  Numerically,  the  blow-up 
was  found  to  occur  at  time  /  =  1.684. 

For  6=0  and  a  =  0,  eq.  (15)  reduces  to 

(N)^  =  (C,-2^N^)[(y-»-)8)N'-Q],  (17) 

where  the  conserved  quantities  are  given  by 

C,=2/3A=  +  (y-)8)rl, 

Q  =  (y+)8)A/^-(y-/3)^^ 

Equation  (17)  may  be  integrated  analytically  and 
the  solution  is 

X  iis(\/=w.  «„  -  0,  •  (18) 

where  ns(«,  k)  is  the  reciprocal  of  the  Jacobi 
elliptic  function  sn(u,  k)  of  argument  u  and 
modulus  k  [17].  In  (18),  tg  is  the  blow-up  time 
determined  by  the  initial  conditions  as 

^ _ 1 

V-2/3(^  +  y) 

J  _ dN _ 

(19) 

This  quantity  is  real  and  finite  for  y  >  -/3  >0  in 
agreement  with  the  failure  of  the  two  conditions 
(12)  and  (13). 

In  the  limit  which  corresponds  to  the 

AL  case,  the  blow-up  time  tg  approaches  infinity 
logarithmically  [16]. 


to  dedicate  this  article  in  honour  of  Professor 
Alwyn  C.  Scott’s  60th  birthday.  Vadim  B.  Kuz¬ 
netsov  is  thanked  for  his  remarks. 
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